* 



1 TCCTCCGGTC GCCCGCCCTC GGGGCAGCTA GTGGCGCAGC CCCCCGCCCG 

51 CGGCCCTGGC CTCCCGGGCG GCGCGGCAGG GGAGGGGTTA AGCTGCCGCA 

101 GGGACCGCCG CGTGCGGGGC GAGAGGGAGC CCCCGGTGGG GGTGGCGCAG 

151 CCGGCGGGGT TCGGTCCGAG CCCGGTGGGA GGCTCCCGGA GCGCAGCCTG 

201 GGCCCAGCCC ACCCCGCGCC GGCGGCCATG GCAGGCACCC TGGACCTGGA 

251 CAAGGGCTGC ACGGTGGAGG AGCTGCTCCG CGGGTGCATC GAAGCCTTCG 

301 ATGACTCCGG GAAGGTGCGG GACCCGCAGC TGGTGCGCAT ATTCCTCATG 

351 ATGCACCCCT GGTACATCCC CTCCTCTCAG CTGGCGGCCA AGCTGCTCCA 

4 01 CATCTACCAA CAATCCCGGA AGGACAACTC CAATTCCCTG CAGGTGAAAA 

451 CGTGCCACCT GGTCAGGTAC TGGATCTCCG CCTTCCCAGC GGAGTTTGAC 

501 TTGAACCCGG AGTTGGCTGA GCAGATCAAG GAGCTGAAGG CTCTGCTAGA 

551 CCAAGAAGGG AACCGACGGC ACAGCAGCCT AATCGACATA GACAGCGTCC 

601 CTACCTACAA GTGGAAGCGG CAGGTGACTC AGCGGAACCC TGTGGGACAG 

651 AAAAAGCGCA AGATGTCCCT GTTGTTTGAC CACCTGGAGC CCATGGAGCT 

7 01 GGCGGAGCAT CTCACCTACT TGGAGTATCG CTCCTTCTGC AAGATCCTGT 

751 TTCAGGACTA TCACAGTTTC GTGACTCATG GCTGCACTGT GGACAACCCC 

801 GTCCTGGAGC GGTTCATCTC CCTCTTCAAC AGCGTCTCAC AGTGGGTGCA 

851 GCTCATGATC CTCAGCAAAC CCACAGCCCC GCAGCGGGCC CTGGTCATCA 

901 CACACTTTGT CCACGTGGCG GAGAAGCTGC TACAGCTGCA GAACTTCAAC 

951 ACGCTGATGG CAGTGGTCGG GGGCCTGAGC CACAGCTCCA TCTCCCGCCT 

1001 CAAGGAGACC CACAGCCACG TTAGCCCTGA GACCATCAAG CTCTGGGAGG 

1051 GTCTCACGGA ACTAGTGACG GCGACAGGCA ACTATGGCAA CTACCGGCGT 

1101 CGGCTGGCAG CCTGTGTGGG CTTCCGCTTC CCGATCCTGG GTGTGCACCT 

1151 CAAGGACCTG GTGGCCCTGC AGCTGGCACT GCCTGACTGG CTGGACCCAG 

1201 CCCGGACCCG GCTCAACGGG GCCAAGATGA AGCAGCTCTT TAGCATCCTG 

12 51 GAGGAGCTGG CCATGGTGAC CAGCCTGCGG CCACCAGTAC AGGCCAACCC 

1301 CGACCTGCTG AGCCTGCTCA CGGTGTCTCT GGATCAGTAT CAGACGGAGG 

1351 ATGAGCTGTA CCAGCTGTCC CTGCAGCGGG AGCCGCGCTC CAAGTCCTCG 

14 01 CCAACCAGCC CCACGAGTTG CACCCCACCA CCCCGGCCCC CGGTACTGGA 

1451 GGAGTGGACC TCGGCTGCCA AACCCAAGCT GGATCAGGCC CTCGTGGTGG 

1501 AGCACATCGA GAAGATGGTG GAGTCTGTGT TCCGGAACTT TGACGTCGAT 

1551 GGGGATGGCC ACATCTCACA GGAAGAATTC CAGATCATCC GTGGGAACTT 

1601 CCCTTACCTC AGCGCCTTTG GGGACCTCGA CCAGAACCAG GATGGCTGCA 

1651 TCAGCAGGGA GGAGATGGTT TCCTATTTCC TGCGCTCCAG CTCTGTGTTG 

1701 GGGGGGCGCA TGGGCTTCGT ACACAACTTC CAGGAGAGCA ACTCCTTGCG 

17 51 CCCCGTCGCC TGCCGCCACT GCAAAGCCCT GATCCTGGGC ATCTACAAGC 

1801 AGGGCCTCAA ATGCCGAGCC TGTGGAGTGA ACTGCCACAA GCAGTGCAAG 

1851 GATCGCCTGT CAGTTGAGTG TCGGCGCAGG GCCCAGAGTG TGAGCCTGGA 

1901 GGGGTCTGCA CCCTCACCCT CACCCATGCA CAGCCACCAT CACCGCGCCT 

1951 TCAGCTTCTC TCTGCCCCGC CCTGGCAGGC GAGGCTCCAG GCCTCCAGCA 

2001 ATCCCCCTCC CAGCAGAGAT CCGTGAGGAG GAGGTACAGA CGGTGGAGGA 

2051 TGGGGTGTTT GACATCCACT TGTAATAGAT GCTGTGGTTG GATCAAGGAC 

2101 TCATTCCTGC CTTGGAGAAA ATACTTCAAC CAGAGCAGGG AGCCTGGGGG 

2151 TGTCGGGGCA GGAGGCTGGG GATGGGGGTG GGATATGAGG GTGGCATGCA 

2201 GCTGAGGGCA GGGCCAGGGC TGGTGTCCCT AAGGTTGTAC AGACTCTTGT 

2251 GAATATTTGT ATTTTCCAGA TGGAATAAAA AGGCCCGTGT AATTAAAAAA 

2301 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA (SEQ ID NO:l) 



FEATURES : 

5'UTR: 1-227 

Start Codon: 228 

Stop Codon: 2073 

3'UTR: 2076 
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# 



Homologous proteins : 

Top 10 BLAST Hits 



CRAI 1000682340958 

CRA| 18000005086608 

CRAI18000005188697 

CRAI18000005205935 

CRAI18000005188699 

CRAU8000005152782 

CRAI18000005192860 

CRAI18000005192861 

CRAI 18000005188698 

CRAI1000733831533 

BLAST dbEST hits: 



/altid-gi I 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 

/altid=gi I 



6358505 

15031623 

16755290 

(7662334 

13928857 

I 9507035 

17242201 

14038292 

15032025 

6650545 



/def 
/def 
/def 
/def 
/def 
/def 
/def 
/def 
/def 

/def=< 



gb|AAF07219.1 IAF043722. 
=ref |NP_005816.1| RAS . 
=ref |NP_035372.1| RAS, . 
ref |NP_056191.1| KIAA. 
gb!AAC7 9700.1j (AF081. 
ref |NP_062084.1| RAS . 
=ref |NP_035376.1! RAS . 
gb|AAC97349.1| (AF106. 
ref |NP_005730.1| RAS . 
gb| AAF21898 . 1 jAF081197 . 



git 
gil 
gil 
gil 
git 
gil 
gil 
gil 
git 
gil 



5432583 

9876673 

11286864 

11285315 

5432584 

4372300 

12295751 

12288965 

6920402 

2005039 



/dataset=dbest 
/dataset=dbest 
/dataset=dbest 
/dataset=dbest 
/ dataset^dbest 
/dataset=dbest 
/dataset=dbest 
/dataset=dbest 
/dataset=dbest 
/dataset=dbest 



/taxon=9606 
/taxon=960. . 

/taxon=96. . 

/taxon^96. . 
/taxon-9606 
/taxon=9606 

/taxon=96 . . 

/taxon=96. . 
/taxon=960. . 
/taxon=9606 



Score 




E 


. 1293 


0. 


0 


. 1241 


0. 


0 


. 1202 


0. 


0 


618 


e- 


175 


533 


e- 


150 


531 


e- 


149 


529 


e- 


149 


526 


e- 


148 


525 


e- 


148 


525 


e- 


148 


Score 




E 


1310 


0. 


0 


1281 


0. 


0 


1249 


0. 


0 


1207 


0. 


0 


733 


0. 


0 


720 


0. 


0 


700 


0. 


0 


599 


e- 


-168 


573 


e- 


-161 


573 


e- 


-161 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 

gi|5432583 Testis 

gi| 9876673 Liver-non-cancerous 

git 11286864 Brain glioblastoma 

gi| 11285315 Brain glioblastoma 

gi|5432584 Testis 

gi| 4372300 B Cell Chronic lymphatic leukemia 
gi| 12295751 Adult marrow 
gi | 12288965 Adult marrow 

gi| 6920402 Lymph germinal center B cell 
gi| 2005039 Lymph 

From tissue screening panels: 
Leukocyte 
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1 MAGTLDLDKG CTVEELLRGC IEAFDDSGKV RDPQLVRIFL MMHPWYIPSS 
51 QLAAKLLHIY QQSRKDNSNS LQVKTCHLVR YWISAFPAEF DLN PELAEQI 
101 KELKALLDQE GNRRHSSLID IDSVPTYKWK RQVTQRNPVG QKKRKMSLLF 
151 DHLEPMELAE HLTYLEYRSF CKILFQDYHS FVTHGCTVDN PVLERFISLF 
201 NSVSQWVQLM ILSKPTAPQR ALVITHFVHV AEKLLQLQNF NTLMAWGGL 
251 SHSSISRLKE THSHVSPETI KLWEGLTELV TATGNYGNYR RRLAACVGFR 
301 FPILGVHLKD LVALQLALPD WLDPARTRLN GAKMKQLFSI LEELAMVTSL 
351 RPPVQANPDL LSLLTVSLDQ YQTEDELYQL SLQREPRSKS SPTSPTSCTP 
401 PPRPPVLEEW TSAAKPKLDQ ALVVEHIEKM VESVFRNFDV DGDGHISQEE 
451 FQIIRGNFPY LSAFGDLDQN QDGCISREEM VSYFLRSSSV LGGRMGFVHN 
501 FQESNSLRPV ACRHCKALIL GIYKQGLKCR ACGVNCHKQC KDRLSVECRR 
551 RAQSVSLEGS APSPSPMHSH HHRAFSFSLP RPGRRGSRPP AIPLPAEIRE 
601 EEVQTVEDGV FDIHL (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

Number of matches : 3 

1 113-116 RRHS 

2 144-147 RKMS 

3 584-587 RRGS 

[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 7 

1 27-29 SGK 

2 63-65 SRK 

3 126-128 TYK 

4 134-136 TQR 

5 269-271 TIK 

6 349-351 SLR 

7 506-508 SLR 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 9 



1 


12 


-15 


TVEE 


2 


63 


-66 


SRKD 


3 


117- 


120 


SLID 


4 


163- 


166 


TYLE 


5 


339- 


342 


SILE 


6 


373- 


376 


TEDE 


7 


447- 


450 


SQEE 


8 


476- 


479 


SREE 


9 


605- 


608 


TVED 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 



1 
2 
3 
4 



19-24 GCIEAF 

249-254 GLSHSS 

284-289 GNYGNY 

4 92-4 97 GGRMGF 
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[5] PDOC00009 PS00009 AMI DAT ION 
Amidation site 

582-585 PGRR 

[6] PDOC00018 PS00018 EFJiAND 
EF-hand calcium-binding domain 

Number of matches: 2 

1 439-451 DVDGDGHISQEEF 

2 468-480 DQNQDGC I SREEM 

[7] PDOC00379 PS00479 DAG_PE_BIND_DOM__l 
Phorbol esters / diacylglycerol binding domain 

4 99-548 HN FQE SN S LRPVACRHCKAL I LG I YKQGLKCRACGVNCHKQCKDRLS VEC 



Membrane spanning structure and domains: 

Helix Begin End Score Certainty 

1 34 54 0.713 Putative 

2 195 215 0.653 Putative 

3 238 258 0.788 Putative 
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BLAST Alignment to Top Hit: 

>CRA| 18000005086608 /altid=gi I 5031623 /def =ref I NP_005816 . 1 1 RAS guanyl 
releasing protein 2 (calcium and DAG-regulated) ; calcium 
and diacylglycerol-regulated guanine nucleotide exchange 
factor I [Homo sapiens] /org=Homo sapiens /taxon=9606 
/dataset=nraa /length=609 
Length - 609 
Score - 1241 bits (3176), Expect = 0.0 
Identities = 608/615 (98%), Positives = 609/615 (98%) 
Frame — +3 

Query: 228 MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRIFLMMHPWYIPSSQLAAKLLHIY 407 

MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVR+FLMMHPWYIPSSQLAAKLLHIY 
Sbjct: 1 MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRMFLMMHPWYIPSSQLAAKLLHIY 60 

Query: 408 QQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLID 587 

QQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLID 
Sbjct: 61 QQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLID 120 

Query: 588 IDSVPTYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHS 7 67 

IDSVPTYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHS 
Sbjct: 121 IDSVPTYKWKRQVTQRN PVGQKKRKMSLLFDHLE PMELAEHLTYLEYRSFCKILFQDYHS 180 

Query: 768 FVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNF 947 

FVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNF 
Sbjct: 181 FVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNF 240 

Query 94 8 NTLMAWGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFR 1127 

NTLMAWGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFR 
Sbjct: 241 NTLMAWGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFR 300 

Query: 1128 FPILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDL 1307 

FPILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDL 
Sbjct: 301 FPILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDL 360 

Query: 1308 LSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQ 14 87 

LSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQ 
Sbjct: 361 LSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQ 420 

Query: 1488 ALWEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 1667 

ALVVEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 
Sbjct: 421 ALWEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 480 

Query 1668 VSYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 1847 

VSYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 
Sbjct: 481 VSYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 540 

Query: 1848 KDRLSVECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPPAIPLPAEIRE 2027 

KDRLSVECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPP EIRE 
Sbjct: 541 KDRLSVECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPP EIRE 594 

Query: 2028 EEVQTVEDGVFDIHL 2072 

EEVQTVEDGVFDIHL 
Sbjct: 595 EEVQTVEDGVFDIHL 609 (SEQ ID NO: 4) 
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>CRA| 1000682340958 /altid=gi I 6358505 / def =gb I AAF07219 . 1 1 AF043722_1 
(AF04 3722) guanine exchange factor MCG7 isoform 1 [Homo 
sapiens] /org=Homo sapiens /taxon=9606 /dataset=nraa 
/length=671 
Length = 671 
Score = 1293 bits (3309), Expect =0.0 

Identities = 639/670 (95%), Positives = 643/670 (95%), Gaps = 4/670 (0%) 
Frame = +3 

Query: 75 GRGGVKLPQGPPRAGREGAPGGGGAAG GVRSEPGGRLPERSLGPAHPAPAAMAGTL 242 

GRG P + +E G +G GVRSEPGGRLPERSLGPAHPAPAAMAGTL 

Sbjct: 8 GRGTQGWPGSSEQHVQEATSSAGLHSGVDELGVRSEPGGRLPERSLGPAHPAPAAMAGTL 67 

Query: 243 DLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRIFLMMHPWYIPSSQLAAKLLHIYQQSRK 422 

DLDKGCTVEELLRGCIEAFDDSGKVRDPQLVR+FLMMHPWYIPSSQLAAKLLHIYQQSRK 
Sbjct: 68 DLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRMFLMMHPWYIPSSQLAAKLLHIYQQSRK 127 

Query: 423 DNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDIDSVP 602 

DNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDIDSVP 
Sbjct: 128 DNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDIDSVP 187 

Query: 603 TYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFVTHG 7 82 

TYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFVTHG 
Sbjct: 188 TYKWKRQVTQRN PVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFVTHG 247 

Query: 783 CTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNTLMA 962 

CTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNTLMA 
Sbjct: 248 CTVDN PVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNTLMA 307 

Query: 963 VVGGLSHS S I SRLKETHSHVS PET IKLWEGLTELVTATGNYGN YRRRLAACVGFRFPI LG 1142 

VVGGLSHS S I SRLKETHSHVS PET IKLWEGLTELVTATGN YGNYRRRLAACVGFRFPI LG 
Sbjct: 308 VVGGLSHSS I SRLKETHSHVS PET IKLWEGLTELVTATGNYGNYRRRLAACVGFRFPILG 367 

Query: 1143 VHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLSLLT 1322 

VHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLSLLT 
Sbjct: 368 VHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLSLLT 427 

Query: 1323 VSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVE 1502 

VSLDQYQTEDELYQLSLQRE PRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVE 
Sbjct: 428 VSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVE 487 

Query: 1503 HIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFL 1682 

HIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFL 
Sbjct: 488 HIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFL 547 

Query: 1683 RSSSVLGGRMGFVHN FQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLS 1862 

RS S SVLGGRMG FVHN FQE SN S LRP VACRHCKAL I LG I YKQG LKCRACGVNCHKQCKDRL S 
Sbjct: 548 RSS SVLGGRMG FVHN FQE SN S LR P VACRHCKAL I LG I YKQGLKCRACGVNCHKQCKDRLS 607 

Query: 1863 VECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPPAIPLPAEIREEEVQT 2042 

VECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPP EIREEEVQT 
Sbjct : 608 VECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPP EIREEEVQT 661 

Query: 2043 VEDGVFDIHL 2072 
VEDGVFDIHL 

Sbjct: 662 VEDGVFDIHL 671 (SEQ ID NO:5) 
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>CRA| 18000005188697 /altid=gi I 6755290 /def =ref I NP_035372 . 1 I RAS, 
guanyl releasing protein 2; RAP 1A protein-specific 
guanine nucleotide exchange factor 1; CalDAG-GEFI [Mus 
musculus] /org=Mus musculus /taxon=10090 /dataset=nraa 
/length=608 
Length = 608 
Score = 1202 bits (3076), Expect = 0.0 
Identities = 589/615 (95%), Positives - 597/615 (96%) 
Frame = +3 

Query: 228 MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRIFLMMHPWYIPSSQLAAKLLHIY 407 

MA TLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVR+FLMMHPWYIPSSQLA+KLLH Y 
Sbjct: 1 MASTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRMFLMMHPWYIPSSQLASKLLHFY 60 

Query: 408 QQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLID 587 

QQSRKDNSNSLQVKTCHLVRYW+SAFPAEFDLNPELAE IKELKALLDQEGNRRHSSLID 
Sbjct: 61 QQSRKDNSNSLQVKTCHLVRYWVSAFPAEFDLNPELAEPIKELKALLDQEGNRRHSSLID 120 

Query: 588 IDSVPTYKWKRQVTQRN PVGQKKRKMSLLFDHLE PMELAEHLTYLEYRSFCKILFQDYHS 7 67 

I+SVPTYKWKRQVTQRNPV QKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHS 
Sbjct: 121 IESVPTYKWKRQVTQRNPVEQKKRKMSLLFDHLE PMELAEHLTYLEYRSFCKILFQDYHS 180 

Query: 768 FVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNF 947 

FVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTA QRALVITHFVHVAEKLLQLQNF 
Sbjct: 181 FVTHGCTVDN PVLERFISLFNSVSQWVQLMILSKPTATQRALVITHFVHVAEKLLQLQNF 240 

Query: 94 8 NTLMAVVGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFR 1127 

NT LMAVVGGLSHS S I SRLKETHSHVS P+T I KLWEGLTELVTATGN Y N YRRRLAACVG FR 
Sbjct: 241 NTLMAWGGLSHS SI SRLKETHSHVS PDTIKLWEGLTELVTATGNYSNYRRRLAACVGFR 300 

Query: 1128 FPILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDL 1307 

FPILGVHLKDLVALQLALPDWLDP RTRLNG AKM+QL FS I LEE LAMVT S LRP PVQAN P DL 
Sbjct: 301 FPILGVHLKDLVALQLALPDWLDPGRTRLNGAKMRQLFSILEELAMVTSLRPPVQANPDL 3 60 

Query: 1308 LSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQ 14 87 

LSLLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTS AKPKLDQ 
Sbjct: 361 LSLLTVSLDQYQTEDELYQLSLQRE PRSKSSPTSPTSCTPPPRPPVLEEWTSVAKPKLDQ 420 

Query: 1488 ALVVEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 1667 

ALV EHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 
Sbjct: 421 ALVAEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEM 480 

Query: 1668 VSYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 1847 

+SYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 
Sbjct: 481 ISYFLRSSSVLGGRMGFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 540 

Query: 1848 KDRLSVECRRRAQSVSLEGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPPAIPLPAEIRE 2027 

KDRLSVECRRRAQSVSLEGSAPSPSP H+ HHRAFSFSLPRPGRR SRPP EIRE 
Sbjct: 541 KDRLSVECRRRAQSVSLEGSAPSPSPTHT-HHRAFSFSLPRPGRRSSRPP EIRE 593 

Query: 2028 EEVQTVEDGVFDIHL 2072 

EEVQTVEDGVFDIHL 
Sbjct: 594 EEVQTVEDGVFDIHL 608 (SEQ ID NO: 6) 
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>CRA! 18000005205935 /altid=gi I 7662334 /def =ref I NP_0561 91 . 1 1 KIAA0846 
protein [Homo sapiens] /org=Homo sapiens /taxon=9606 
/dataset=nraa /length=689 
Length = 689 
Score = 618 bits (1576), Expect = e-175 

Identities = 314/597 (52%), Positives - 409/597 (67%), Gaps = 5/597 (0%) 
Frame = +3 

Query: 234 GTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRIFLMMHPWYIPSSQLAAKLLHIYQQ 413 

G+ L K T++ELL CIE FDD+G++ + L RI L+MH WY+ S++LA KLL +Y+ 
Sbjct: 2 GSSGLGKAATLDELLCTCIEMFDDNGELDNSYLPRIVLLMHRWYLSSTELAEKLLCMYRN 61 

Query: 414 SRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDID 593 

+ ++ N ++K C+ +RYWI FPAEF+L+ L +E + + Q G +H SLIDI 
Sbjct: 62 ATGESCNE FRLKI CYFMRYW I LKFPAE FNLDLGLIRMTEEFREVASQLGYEKHVSL I D I S 121 

Query: 594 SVPTYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFV 773 

S+p+Y W R+VTQR V KK K LLFDHLEP+ELAEHLT+LE++SF +1 F DY S+V 
Sbjct: 122 SIPSYDWMRRVTQRKKVS-KKGKACLLFDHLEPIELAEHLTFLEHKSFRRISFTDYQSYV 180 

Query: 77 4 THGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNT 953 

HGC +NP LER I+LFN +S+WVQLM+LSKPT QRA VIT F++VA+KLLQL+NFNT 
Sbjct: 181 IHGCLENNPTLERSIALFNGISKWVQLMVLSKPTPQQRAEVITKFINVAKKLLQLKNFNT 240 

Query: 954 LMAWGGLSHSSISRLKETHSHVSPETIKLWEGLTELVTATGNYGNYRRRLAACVGFRFP 1133 

LMAVVGGLSHSSISRLKETHSH+S E K W +TELV++ GNY NYR+ A C GF+ P 
Sbjct: 241 LMAVVGGLSHSSISRLKETHSHLSSEVTKNWNEMTELVSSNGNYCNYRKAFADCDGFKIP 300 

Query: 1134 ILGVHLKDLVALQLALPDWLDPARTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLS 1313 

ILGVHLKDL+A+ + PDW + ++N KM QL L EL + + ++ N DL++ 
Sbjct: 301 ILGVHLKDLIAVHVIFPDWTE — ENKVNIVKMHQLSVTLSELVSLQNASHHLEPNMDLIN 358 

Query: 1314 LLTVSLDQYQTEDELYQLSLQREPRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQAL 14 93 

LLT+SLD Y TED++Y+LSL EPR+ SPTSPT+ P +P V EW PK D + 

Sbjct: 359 LLTLSLDLYHTEDDIYKLSLVLEPRNSKSPTSPTT PNKPVVPLEWALGVMPKPDPTV 415 

Query: 1494 VVEHIEKMVESVFRNFDVDGDGHISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVS 1673 

+ +HI K+VESVFRN+D D DG+ISQE+F+ I NFP+L +F LD++QDG IS++EM++ 
Sbjct: 416 INKHIRKLVESVFRNYDHDHDGYI SQEDFES IAANFPFLDSFCVLDKDQDGLI SKDEMMA 475 

Query: 1674 YFLRSSSVLGGRM— GFVHNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQC 1847 

YFLR+ S L +M GF+HNFQE L+P C HC + GI KQG KC+ CG NCHKQC 
Sbjct: 476 YFLRAKSQLHCKMGPGFIHNFQEMTYLKPTFCEHCAGFLWGIIKQGYKCKDCGANCHKQC 535 

Query: 1848 KDRLSVECRRRAQSVSL EGSAPSPSPMHSHHHRAFSFSLPRPGRRGSRPPAIPL 2009 

KD L + CRR A++ SL GS P + F F G R AI L 

Sbjct: 536 KDLLVLACRRFARAPSLSSGHGSLPGSPSLPPAQDEVFEFPGVTAGHRDLDSRAITL 592 (SEQ 
ID NO:7) 



>CRA|18000005188699 /altid-gi I 3928857 /def =gb |AAC7 9700 . 1 1 (AF081196) 
calcium and DAG-regulated guanine nucleotide exchange 
factor II [Rattus norvegicus] /org=Rattus norvegicus 
/taxon=10116 /dataset=nraa /length=795 
Length =7 95 
Score = 533 bits (1358), Expect = e-150 

Identities = 267/590 (45%), Positives = 390/590 (65%), Gaps - 12/590 (2%) 
Frame = +3 

Query: 156 GVRSEPGGRLPERSLG PAHPAPAAMAGTLD LDKGCTVEELLRGCIEAFDDS 308 

G R+ P GRL +S PA ++A L KG ++++L+ CI++FD 

Sbjct: 17 GSRAGPKGRLEAKSTNSPLPAQPSLAQITQFRMMVSLGHLAKGASLDDLIDSCIQSFDAD 76 
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Query: 309 GKV-RDPQLVRIFLMMHPWYIPSSQLAAKLLHIYQQSRKDNSNSLQVKTCHLVRYWISAF 485 

G + R QL+++ L MH I S++L KL+++Y+ + + NS + +K C+ VRYWI+ F 
Sbjct: 77 GNLCRSNQLLQVMLTMHRI I ISSAELLQKLMNLYKDALEKNSPGICLKICYFVRYWITEF 136 

Query: 486 PAEFDLNPELAEQIKELKALLDQEGNRRHSSLIDIDSVPTYKWKRQVTQRNPVG-QKKRK 662 

F ++ L ++E + L+ G H LID + + W R++TQR KKRK 
Sbjct: 137 WIMFKMDASLTSTMEEFQDLVKANGEESHCHLIDTTQINSRDWSRKLTQRIKSNTSKKRK 196 

Query: 663 MSLLFDHLEPMELAEHLTYLEYRSFCKILFQDYHSFVTHGCTVDNPVLERFISLFNSVSQ 842 

+SLLFDHLEP EL+EHLTYLE++SF +1 F DY +++ + C +NP +ER I+L N +SQ 
Sbjct: 197 VSLLFDHLEPEELSEHLTYLEFKSFRRISFSDYQNYLVNSCVKENPTMERSIALCNGISQ 256 

Query: 843 WVQLMILSKPTAPQRALVITHFVHVAEKLLQLQNFNTLMAWGGLSHSSISRLKETHSHV 1022 

WVQLM+LS+PT RA V F+HVA+KL QLQNFNTLMAV+GGL HSSISRLKET SHV 
Sbjct: 257 WVQLMVLSRPTPQLRAEVFIKFIHVAQKLHQLQNFNTLMAVIGGLCHSSISRLKETSSHV 316 

Query: 1023 SPETIKLWEGLTELVTATGNYGNYRRRLAACVGFRFPILGVHLKDLVALQLALPDWLDPA 1202 

E K+ +TEL+++ NY NYRR C F+ P I LGVHLKDL+ + L A+PD+L+ 
Sbjct: 317 PHEINKVLGEMTELLSSCRNYDNYRRAYGECTHFKIPILGVHLKDLISLYEAMPDYLEDG 376 

Query: 1203 RTRLNGAKMKQLFSILEELAMVTSLRPPVQANPDLLSLLTVSLDQYQTEDELYQLSLQRE 1382 

++N K+ L++ + EL + + PP+ AN DL+ LLT+SLD Y TEDE+Y+LS RE 
Sbjct: 377 --KVNVQKLLALYNHINELVQLQDVAPPLDANKDLVHLLTLSLDLYYTEDEIYELSYARE 434 

Query: 1383 PRSKSSPTSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVEHIEKMVESVFRNFDVDGDGH 1562 

PR+ +p p +PPV+ +W S PK D + +H+++MV+SVF+N+D+D DG+ 

Sbjct: 435 PRNHRAPP LTPSKPPWVDWASGVSPKPDPKTISKHVQRMVDSVFKNYDLDQDGY 489 

Query: 1563 ISQEEFQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFLRSSSVLGG-RMGFVHNFQES 1739 

ISQEEF+ I +FP+ +F +D++++G ISR+E+ +YF+R+SS+ +GF HNFQE+ 

Sbjct: 4 90 ISQEEFEKIAASFPF— SFCVMDKDREGLISRDEITAYFMRASSIYSKLGLGFPHNFQET 547 

Query: 1740 NSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVECRRRAQS 1889 

L+P C +C + G+ KQG +C+ CG+NCHKQCKD + EC++R++S 
Sbjct: 548 TYLKPTFCDNCAGFLWGVIKQGYRCKDCGMNCHKQCKDLVVFECKKRSKS 597 (SEQ ID 
NO:8) 



Hmmer search results (Pfam) 

Model Description 



Score 



E-value N 



PF00617 RasGEF domain 123.5 4e-33 1 

PF00130 Phorbol esters/diacylglycerol binding domain 59.5 3.6e-14 1 

PF00036 EF hand 21.8 0.00027 2 

PF01237 Oxysterol-binding protein 3.5 4.2 1 



Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm-t 




score 


E-value 


PF01237 


1/1 


249 


272 . 


1 


24 


[. 


3.5 


4.2 


PF00617 


1/1 


148 


336 . 


1 


227 


[] 


123.5 


4e-33 


PF00036 


1/2 


430 


458 . 


1 


29 


□ 


17.4 


0.0047 


PF00036 


2/2 


4 63 


482 . 


5 


24 




6.7 


4.9 


PF00130 


1/1 


499 


548 . 


1 


51 


[] 


59.5 


3.6e-14 



FIGURE 2, page 7 of 7 




1 ACAGAAAGGT CCTGTTTCTA AGTCTTACAT TACCAAGACT GAGGTGCGGG 
51 GGCGGTCCTG GATCCCCCGC CCCAAGGCTG GGAGGGGCAC GCCTCGGAAG 
101 GGAGGTTTGG GGTCGGTGGT TTCACAGTGA GTGTGTCTGA AGCCAAATGG 
151 TCGGAAACCG TTACCCGCTC TCCTAGGCCC GGCTAGTGGG GACCCCAACC 
201 GCCTGCGGCT GCCCCTCCCA AGTTCCTCCC TGTTGGCCAG GCATCCAGGT 
251 CTCCAGTCTC CGAGCTGCGG AGAACCCACC GCCACATGCG GCTGCCCCTT 
301 TCCATTCGAC CCTGTGGGGA GCCAGGCTTC CGGGGCCCCG TTCCTCCTGT 
351 GTGAACTGGG CCCCCCGCCC CCATTCCCAG ACATCAAGGC CGCGTCTCCA 
401 GATAGCCACG ATTTCATTCC TCGCTCCCCA CAGGTCCCTC TCCCCAAAAT 
4 51 ATTCCCATCT TGTCCTAGCC CATCCCCCAG ACTATCTCAA GGACCAGCTG 
501 TCCCCACGCC CCCGACCTCC ACTAGGCCTG TGCCACCCGC TGCCTGCAGG 
551 AAGACGCCCG GTCCCGGGCC GGGTTAGCCC CATGGGAACG GTTTGTCTCG 
601 AAAACAGGAA CCCGAGCTGG GGGCTGGGCG GGGCGCCCCT TCCCCACCGC 
651 AGTCCGCTTC CTGCCCCTCC CGGCTTCCTC CGCCCGACAC CCAGGCAGGG 
701 CGGGGGGCAC TGGGGCGTCC GCGGTTGGGG GAGGGGCTCT TCGTTTCGGT 
751 CCCCCCTCCC GCGTCCCGGG CGGCGGGGCC TCCGGTCGCC CGCCTCGGGG 
801 CAGCTAGTGG CGCAGCCCCC CGCCCGCGGC CCTGGCCTCC CGGGCGGCGC 
851 GGCAGGGGAG GGGTTAAGCT GCCGCAGGGA CCGCCGCGTG CGGGGCGAGA 
901 GGGAGCCCCC GGTGGGGGTG GCGCAGCCGG CGGTGCGGAG CTCCGCGCAG 
951 GGGCGGAGGG GGGAGGGGGC AGCCTGGCGC GGGGGCGGGG GCGGGGCGGC 
1001 GGGGAGCGGG GCCGCGGCGT GGAGAGCGGG CGGGAGCCGC AGCCGCAGCG 
1051 AGGCCGGCGG GCGGGAGCGC ACGGAGGTGG GGTCGGCCAG GCCGGTGCGG 
1101 GCTCCTTGCG GCAGGTCCCA AGAGTGAGTG GGCGAGCGCG GGCGGGGCGC 
1151 CAGGCGAAGG AGGGCGCGGC CCCCAGCGAC TCCCCCCCCG CCCAGGGCGG 
1201 CGCGGGCGGG CTGGGGGCGG CGAGCGGGTG GGGAGTCTGC GGCCCGGGTC 
1251 TGGGAGAGGG GGCAGCGGCC ACGAGAGCTA AGGCGCGCTG GATCCCCGGA 
1301 GGGCGGAGGA CCTCCACGGT GCACCCAGCT TTTCCCAGCC ACCTTCCAGC 
1351 GGGGCCCTCC CCCGCGTACC CCCATTTGGC AGATGAGAAA ATTGAGGCTC 
1401 CCAGAGGCCA AGTGATTCTC AAGGTCACAC GAGGAAGCGG TAGAGCCAGG 
1451 CGGGGACGGC TCTGGGTGGC TCTTAGGAAA AGTCCGCCTG AGAACTCCGT 
1501 ACAGGAGCTC CCCTGTCCTC CAGCCTGGGG GAGTGAGTAT GTGTAGGGCC 
1551 GGGGTACCTT TCCGTGGGGC AAGGCTCTGC CAAAATCTGG GAGTGAGGGG 
1601 AGTCAGGGAG CTGGGGCCGC AGGGCGGGCC CTGCACCGCA AATGGGAGGG 
1651 GGGCGACGGA ATGGGCGTGC GCACCCATGG GGGTGTGTGC ATGTGTGTGG 
1701 GAGTGTACAT GCGTGGAGAG GCACTGCCTT GCGTGTGTGC ACACGTGTGA 
1751 GGATGTCAGC GCCTGTGTGG CCGCGGGACT CAAGGCTGGC CTGGCTCAAG 
1801 TGAACAGCAC GTCCAGGAGG CGACCTCGTC CGCGGGTTTG CATTCTGGGG 
1851 TGGACGAGCT GGGTATGTGT GCCTGAGGGT TTCTTCGTGC AGGTGTGCAC 
1901 AGGGTGTGGG TGCCATTGTG TGTGAGAGAC GGAGGATGGG GAGGCCGGTG 
1951 CCTGTGGCCC GGTGCGTGTA AGTGCGGACG CCTGCACCTC CACTTAGGTC 
2001 CCCGGCCTCC GACGACTAAC TTGGGTGTGG AGTGTTTGCC CCTGCCAGGG 
2051 TGCGTATGAC CCCGCCAGTG ACCGGAGTTG CTAATGGTGT CATGCACCCA 
2101 CCGGCCACCC TTGGCGCGAG CGCCCCCCTC TGGACACCCT GCTCCGTGCG 
2151 CGCTCACAGT TCGCCTGTGC GGGGCCGGGG CCAGGGTCAG GAGCCGGGGA 
2201 TAGGGAGGAA GAGGGCCTGT GGACAAGCTG AGCCGGGACC CCTGGGACCT 
2251 TTGCGGAGGT GGCCTGGGAG CGCTCAGTTC CCAGGCTGAG GCTTCCCGCT 
2301 GACGCCTCCT GGCCGCAGCG GGCTCCCCCC GCCCCAGGAA TGTTCCTCTC 
2351 CCATCCAGTC CGCCTCCCCT AGGGCAGGCC CCCTGGGGGC TGCCGCAGCC 
2401 CCGCCTCGCC TTCCTGGGCT CCCGGGAGGG GGCGAGGCGA GCAGGACGCC 
24 51 TGGGTTCTCT CCCCCCACCT CCCATACCAG GGAGAAATTC CTCCGAGGTC 
2501 CCCTCAGGCT CTGGGTTCCC AAAATAACCC TGCGGGGGAA GGGAGGCTGT 
2551 GGAGGGAGGG AAGCGGGAGG GGCGCAGAGC CGAGCTGCGG GGTGCTGCAG 
2601 GTGCCTCTGG GGAGAGGGCG CGAGGAGAAG GCGCCCTGCG GGGGGCTGGG 
2651 CGCCAGCCAG TCCTGGGATC TTGGTTCGTC CCCATCCTCG TGAAGCCCCT 
2701 CGGCCTTCCC GCGACTCCGA GGGTGGGCCG GAAGCCTCTC TGCGGGTCCG 
27 51 TTTCCCAACT GGCGGGTTGC ACCATCCCGG GCCAGACCGT TTAACCCCGG 
2801 GAGTGGCCGC GGGGGACAAC TCCGCCCCTG TCCAGCAGGG GGCGTGCCCG 
2851 CCCCGCCCCG TTTCTGCCCG CGGGGCCGCT CCCCCGCCCG CGACTCCGCA 
2901 GACTCCCGCT CTGCCTCTCC CGGGACAGGG GTTCGGTCCG AGCCCGGTGG 
2951 GAGGCTCCCG GAGCGCAGCC TGGGCCCAGC CCACCCCGCG CCGGCGGCCA 
3001 TGGCAGGCAC CCTGGACCTG GACAAGGGCT GCACGGTGGA GGAGCTGCTC 
3051 CGCGGGTGCA TCGAAGCCTT CGGTGAGTGG CTCGGGAGGG CACACGGAGC 
3101 CTGAGCCTAG CCCCGAGTCT GAGCCCGGGT CCCTGCCTCC CAGGCACAGT 
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3151 CCAGGGCACA GCCCTGACCC GGACCCACCC TGCTCCGCAG CGTGCAGTCT 
3201 CTTTAACGAA AGCCTCCTCC GCAACGCAGG GCAGAGAGAT GCACGCCCTT 
3251 CAGACAGATG AGGTTTCCCT TCTCTAGCCT TCCCCAGCGG CGGCGAAGGG 
3301 AGGGCCGGGT CCCGGACTCT GACACTTGAG GGGCATTATC TGTCTCCCGG 
3351 GGAATCCGGA GGAACTCGCT ATCTCCGGCC TGGGAGCTGT TTCCGGCTAA 
34 01 TGGGGGGCGG CTTATCTGGT GAAGGGGTGC CCCTTCCCCC CAAGCGCTCA 
34 51 GGAAATGACC TCTGGATTCT TGACCCCGGG GAACCCAGGC TCCTTCCGCC 
3501 CCAGCTGGTT CCCCTCCGGA CGATGGGCGG CTCGGGCGCT CCCCTCCTCC 
3551 AGTCCTCAGG GCGTGCCTAT CTCTCGCCCA CCACACCTTT CCTCTCTAAT 
3601 TTGCCTCCTG CTCTCGGAGT CCTGGGCAAG CAGGAGGTGG GCGGGGTCGA 
3651 GCGTGCACCC GAAGGACCGA TACCTGGCGG GTTGCGGGGT GAGGATGAGG 
3701 CATGGTAGCT GCGGACCCAG CTCAGCCACC TGTCTTTGAC CCTTCGGAGT 
3751 CAGATGACTC CGGGAAGGTG CGGGACCCGC AGCTGGTGCG CATGTTCCTC 
3801 ATGATGCACC CCTGGTACAT CCCCTCCTCT CAGCTGGCGG CCAAGCTGCT 
3851 CCACATATAT CCTTCGCCGG CCTTGCCAAG GCCCCCGCCG TCGGAGCCCA 
3901 TGCGCAGCCC CTCTGCCCAG CCCAGGTGCA GAATGAGCCT CGCTCCTAAG 
3951 TATAGGCCAC TCCTTATCCC AGAGCTCAGG CGTCGTCCCA GCCTCCAACT 
4001 AGGGCCTAGG CTCTGCCCCC TCCTTGCTCC TAGCGACTCG GTCCTGTCCC 
4 051 CAGGCTCTGT CCCCAGCCGA GGCCTTGCCC TCCTTCTCCC TAGAGTCTAG 
4101 GGCCTGCCCC TGCTTCAGGC TTGGGTGCGC CCCGTGCATC TCTCTCTCCC 
4151 AGAGCCCAGG CTTTGCTTTC AGCCTCCCTC AGCACCTAGT CCTCCACCCC 
4201 CACCTCCAAC CCCTCCCAGA GCTCAAGCCT CACCCCCAGC ATCTCCGCAG 
4251 AGCGCAAGCC CCATCCCTAG AACGTGTCTC CTAGAACCAG GCCCCGCCCC 
4301 CAGCCTCCCT CCACGCAGGC CTCCCTTTCT AGAGTTAAGC GGCCTCCTTA 
4351 ACCCTCTCCT TCACCTACCA ACAATCCCGG AAGGACAACT CCAATTCCCT 
4401 GCAGGTGAAA ACGTGCCACC TGGTCAGGTG AGTCTTTCCC CTGGGGCTCT 
4451 AGCCCCTCCC CTTTCTCCCT TCTCTCTGGC TTCAGGCTGG CCTGGAGGAG 
4501 GGGGCAGGGC GCTGTTTCTG GGAGTGGGTT TGAACCCTGG CTTGTCCGGG 
4551 TGGGCAGTGC TGCCACAGGC TCACCCCTTC CTGGGTCTGG GCCTTAATTT 
4 601 TCTTTTCTGC GCAGTGCGGG TGGTTGTCTC AAGGGTCTAA TGTACACTTG 
4 651 GAGTGGCGAA GGAAAGAGCT GGAACCATAG TTTGAGGGTC TTTTTGCTTA 
4701 GGTGACTATA ATCTCAAATA GCTCCTTGCA GCCTGCTGGG TGATGGTGGG 
47 51 GGAAGGGCTA TCTTGGGTGA CTCCCCGCTC CTCCAGGTAC TGGATCTCCG 
4801 CCTTCCCAGC GGAGTTTGAC TTGAACCCGG AGTTGGCTGA GCAGATCAAG 
4851 GAGCTGAAGG CTCTGCTAGA CCAAGAAGGG AACCGACGGC ACAGCAGCCT 
4 901 AATCGACATA GACAGCGTGT GCGTGGGGGG AGCACAGAGG GCTGGGGGGG 
4 951 CACTCAGTAT CCTATACCAT CTGTGCTTAA TAAATGTCTG TTGAACTGAA 
5001 TGAGTGAGGG TCATGTTGCT CTCTCGCTTA AAAACCTTCC ATGGCTCCCT 
5051 ATTGCCTTCA ACATGCCTCC TCTGGGCAGC TTGGCGTTCC TGCCTCATCT 
5101 TCCACTGCCA CCACCCATCC CACACACCTC CTCCTGTAGC TGCGCTGGGT 
5151 CGGCTCCCCG TCGGCTGAGC TCTCGAGTCC TTTCTCATCA TGGTGCTCTG 
5201 CT CAT AT CAT CCCCCTTGCT GCCTCCTCCG TGTTACCAAG ACTCAGTTCA 
5251 GGCATGAAGT CTCCGTGGGC TCTGAGGGTT CGGGGCTCTT CCGGGGTAGA 
5301 ATTTGTCGTT CCCACCTCTG TTTTCCATGG CACTTTGTAC AGACTCCTGT 
5351 ACAAAGACCT CTGTACATGT GTCACGCTGT TTTGTGATCA TGTGTTTCTG 
54 01 TGTCTGTCTC CCTCAGTAGA CTGTGAGCTC CTCGAGGGCA GGAACCGTGT 
54 51 CTTACTCATC TCTGTATTCC CAGCGCCTAG CACAGTGCCT GGCACAGAGT 
5501 ACGTTGTTCA TAAATGTGTG TTGAGTGCAT GACGGGGTGG GGGGAGATGA 
5551 GGAGGAGTTG CTGGGACTGG GAACATTCGT GCCTAGGACA GTGCCTCGCA 
5601 TTATGTAGGT TCTCAGTAAG CGTGAATGGT GTGTCTGTGT GAGTGGGGGG 
5651 CCACGAGGCA TGCGCATGTC CAGCAAAGGG CTCACTACCC CTGCCCCCCC 
57 01 AGCCCTACCT ACAAGTGGAA GCGGCAGGTG ACTCAGCGGA ACCCTGTGGG 
57 51 ACAGAAAAAG CGCAAGATGT CCCTGTTGTT TGACCACCTG GAGCCC AT GG 
5801 AGCTGGCGGA GCATCTCACC TACTTGGAGT ATCGCTCCTT CTGCAAGATC 
5851 CTGGTGCGGC CCGAGGGCTG GGGGGTCAGG GGTCCAATGT GGGCTGGAAG 
5901 AGAGTTCTAG GAGGGGCAGG GTCCCTGGCG TAGGCTGGGT CACAGGGTGC 
5951 ATCAGGGGTT TCAGTGTAAC CACTGAAGGT CAGCTGGAGG GTGAGGAGTG 
6001 GCTATCAGTG AGGGGAGAGG CCGGCAAGGT GCTGAGGCCA CTCCTCATGC 
6051 CCCCAGTTTC AGGACTATCA CAGTTTCGTG ACTCATGGCT GCACTGTGGA 
6101 CAACCCCGTC CTGGAGCGGT TCATCTCCCT CTTCAACAGC GTCTCACAGT 
6151 GGGTGCAGCT CATGATCCTC AGCAAACCCA CAGCCCCGCA GCGGGCCCTG 
6201 GT CATC AC AC ACTTTGTCCA CGTGGCGGAG GTGCCTGCCC CTCCCTCCCG 
6251 GTGTCTCCCA ACCACCCCAC ATGCCAGTCA GGCCAACCCT TCCCTTCCCC 
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6301 TAACCCACTG CCTTCTCTCT AGATAAGCTG GGCCAAATTC TGGGCCCACT 
6351 CAGTGACTCC CTGCCTCTCC GTCCCCATTT GCCTTCCAGA AGCTGCTACA 
6401 GCTGCAGAAC TTCAACACGC TGATGGCAGT GGTCGGGGGC CTGAGCCACA 
6451 GCTCCATCTC CCGCCTCAAG GAGACCCACA GCCACGTTAG CCCTGAGACC 
6501 ATCAAGGTGC CTGGGACTGG GGAGGGGCCG GTGCTTCCCA GGTCTGTCTT 
6551 CACTGGGTCC TCCCAGCAGC ACTGGGGGCT GGGCACAGCT GTCCTCATTT 
6601 GATAGATATG GAAATGGAGG CTCAGAGGGG TTAAGTGCTT TTCTCAGTTT 
6651 GCACAATGGC AACAGCAGAG TGGGGGCTCA CAGGTCGTCA GGGACCCCAA 
6701 AGCTAGTACT TTTTTTTTTT TTTTTAAGAC AGGGTCTCTC TCTCTGTTGT 
6751 CCAGACTGGA GTTCAGTGGT GCAGTCACAA GCTCACTGCA GCCTTGAATT 
6801 CCTGAGCTCA ATCGATCCTC CCACCTCAGC CTCCTGAGTA GCTGGGACTA 
6851 CAGGTGTACG CCACCATGCC TAATTTTTGT ATTGTTATTA ATTTTTTTTT 
6901 TTTTTTTTTA GAGATGGGGT TTTGCCATGT TGCCCAGACT GGTCTTGAAC 
6951 TCCTGGGCTC AAGTGATCCG CCTGCCTTGG CCTCCCAAAG TGCTGAGATT 
7001 ATGGCTTGAG CCATTGTGCC TTGCCACTTG TAGTTTCTTC TTTTCTTTCT 
7051 CCTTCATTTT TTATTATTTT TGAAGTATTT TGAAGTATTG AGTAACATAC 
7101 ATATAGAAAA GTATATAAAA ACATATGAGA CTGGGCGTAG TAGCTCACAC 
7151 CTGTAATCCC AGCACTTTGG GAGGCTGAGG TGGGCAGATC ACGTGACATC 
7 201 AGGAGTTTGA GACCAGCCTG GCCAACAAGG TGGAAACCCA TCTCTACTAA 
7251 AATACAAAAA TTAGCCAGGC ATGGTGGCAC GCACCTGGAA TCCAAGCTAC 
7301 TTGGGAGGCT GAGGCAGGAG GAGAATTACT TGAACTCAGG AGGCGGAGGT 
7 351 TGCAGTGAGC CAAGATTGTG CCACTTCACT CCAGCCTGGG CGACAGAGTG 
7401 AGACTCCATC TAAAAAAAAA GAAAAGTATA TAAAAACATA TGAATAGTTT 
7451 AAAGAAAAAT TGTAAAGAAA ACACTGTGTA ACTACTGCCC GGGTTGGGAA 
7 501 ATAGAACCTT GCCAGGCCCC CAAGCGCCCA GCACTTTAGA GCATAACTCC 
7 551 CTCCCCACGA CTTTTGCAAT GATGATCTTG CTTTTCTTTA TAGCTTCACC 
7 601 ATGTAGGTAT GCGGTCCAAA ACAATGTGGG GCTTTTTGTT GTCTGTTTTG 
7 651 AACTTTCTAT GAATGGAATG TTGTTTGTGT TATTTTATGT CTTGCTTTTT 
7701 TCATTCCACA TGGTTCTGAG AGTCTTTTCA TTCTGTCATG TGGAGCAATT 
7751 GTTTTTTCAT TTTCATTGCC ATATAATATT TTATTGTACG TCTACCCCAA 
7801 TTCATTTATT TATTTATTTT TTTGAGATGG AGTCTGTCTC TGTCATCCAG 
7851 GCTGGAGTGC GGTGGCGAGA TCT CATC ACT GCAACTTCCG TCTCCTGGGT 
7 901 TTACGTGATT CTCGTGCCTC AGCCTCCTGA GTAGCTGGGA TTATGGGCTC 

7 951 GTACCACCAC GTCTGGCTAA TTTTTTGTAG AGACAGGCTT TCACCATGTT 
8001 GCCGAGGCTG GTCTTGAACT CCTGAGCTCA GGCAATCCAC CCGCTTTAGC 
8051 CTCCCAAAGT GCTGGGATTA CAGGTGTGAG CCACTGCCCC CAGCCTACCC 
8101 CAATTTATGT ATT GAT TCT A TTGTTGAATG TTGGGGTTTT TCCTTTTCTT 
8151 TTCTTTCTTT CTTTTTCTTT CTTTTTTTCT TTTTTTTGGA GAGGGAGTCT 
8201 TGCTCTGTCG CCAGGCTGGA GTGCAGTGAC GCTAATTTGG CTCACTGCAT 
8251 CACTGCACCC TCTGCCTCCC GGGTTCAAGC GATTCTCCTG CCTCAGCCTC 
8301 CTGAGTAGCT GGGACTACAG GCATGCACCA CCACACCCGG CTAATTTTTG 
8351 TATTTTTTTA GTAGAGATGA GGTTTCCACC ATGTTGGCCA AGATGGTCTC 
84 01 CATCTCTTGA CCTCATGATC CATCTGCCAT GGCCTCCCAA AGTGCTGAGA 
8451 TTACAAGTGT GAGCCACCAC GCCCAGCTGG TTTTTCCAGT TTTTGCTGTT 
8501 TGGACGGGGT GGCTGAGTAT GTTCTTCCAG GTCATTGTCC TGTGCTGCCT 
8551 TGCCTCCCTG AGCCTCTGTT TCTCCTGTTA AATGTTGATG ATTCCCTGCA 

8 601 TCCAGGCCTG GTTTAGAGGT GTGGTGCTTT TGGCAGTGAG TATTGCCTTG 
8651 AATTCATGGC AATGAATTCA ATCCCCAGGG GCTGAGAGAG CCAGTCGTGG 
87 01 GGGACAGTAA GGGAGGTTTT TACTCTTTCA CCTGTCCCTG ACCCTGACTC 
8751 CTCCTCACCC CCTCCTACAT TTCCAGGGCT GAGGTAGGGA GGATAGTTGT 
8801 GGGGGTATGA CTCCTCTGTC CTTTGTCCCC AGCTCTGGGA GGGTCTCACG 
8851 GAACTAGTGA CGGCGACAGG CAACTATGGC AACTACCGGC GTCGGCTGGC 
8901 AGCCTGTGTG GGCTTCCGCT TCCCGATCCT GGGTGTGCAC CTCAAGGACC 
8951 TGGTGGCCCT GCAGCTGGCA CTGCCTGACT GGCTGGACCC AGCCCGGACC 
9001 CGGCTCAACG GGGCCAAGAT GAAGCAGCTC TTTAGCATCC TGGAGGAGCT 
9051 GGCCATGGTG ACCAGCCTGC GGCCACCAGT ACAGGCCAAC CCCGACCTGC 
9101 TGAGCCTGCT CACGGTGAGG AGCAGGGGGC AGGGAGGTGG GGAGCTGGGC 
9151 ACCAGGGGTT GACAGTTTCC CCAGGTCCTG GCTGTGGGCG TGGCCTGGGG 
9201 CTCTGGGTTC TGGCCAAGAA ACTGAGATCT AGCGTGGGCT CTGGGGTTTG 
9251 GAGTGGATGC TGAGAAGGGG TCCAGGCTCT GGTTGGGGCT GTGGACTGAG 
9301 GTCTGATCTC CAGGCTGGTA TGTGGACTGT GGGCAGTTTG AACTGGGCCT 
9351 GGGTCCCGGG TTGAGTTCTG GCAATGGGCT GTGTTCTAGG GCTGGGCCAA 
9401 GCTCTGCATT CTGTGGGCAG GGGTGGTTTC TAAGCATGGC CCTGGGCTCG 
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GAGTGAAGTT 
GAGTTGGGTT 
GCTTAAGTGT 
AGGGGATCAG 
CTTCCCTGAC 
GTGGTTTGAT 
TAAGCCAGGC 
CTGTGGCCTC 
GGGCCAGTGG 
TATCAGACGG 
CTCCAAGTCC 
CTCCTGAGAA 
TCACCCATCC 
GTTGGGCAGA 
TCTGGGCTCC 
GCCAACCAGC 
AGGAGTGGAC 
GAGCACATCG 
GTGTGGGGAG 
GTGTCCTGTT 
CAGGCCAGCT 
GTATGGAACG 
ATGGACTGGG 
CCCTCTTCTG 
AACTTTGACG 
CATCCGTGGG 
ACCAGTGAGG 
ACTTCTGCCT 
AGATGATCTT 
TGTGGGGAGG 
TTCAGACAGA 
GGACTGTGGC 
AGAAGTGTGG 
AAGAAAGGTT 
GCTGACAATT 
CCCTGAAGCC 
TCAAAATTTA 
ATTTCAAAAA 
AAATAAAATC 
GTAAGAAGTG 
TTTGGGAGAC 
GCCTGGGCAA 
TTTTGAGACA 
ATCTCGGCTC 
TTCAGCTTCC 
TAATTTTTGT 
GGTCTCAAAC 
TGCTAGGATT 
TTGGGCATGG 
GCAGGAGGAA 
CACACCACTG 
AAAAAAAAAA 
TTTTCCACTG 
CTTGGCAGTA 
TTGATAGAGA 
AGAGCATAAA 
CTTCTCCCAG 
GGTTGGTGGG 
TTTGAGACGA 
TCCACCTCCT 
GTGTGTGTGT 
TCAAAAACCT 
CAGGACCAGG 



CTGGGCTTGG 
CTGGTTTAGA 
TAAGGTTTGG 
GGTTGAAATC 
ATCTTGGCAA 
CTGTGCGCTG 
TTTGTCCTGA 
TAGGTCGACC 
GGGTGATCAG 
AGGATGAGCT 
TCGGTGAGGG 
TCCCAGGATG 
ATCTTGTAGG 
CTTCCCTCTC 
CCCTGCCTCT 
CCCACGAGTT 
CTCGGCTGCC 
AGAAGATGGT 
AGGCCAGTTT 
CAAGACCCAG 
GCACGGGGCT 
GGGGTGGTGT 
AACGTGGGAA 
GCTCTGCGTG 
TCGATGGGGA 
AACTTCCCTT 
AGGGCTGGGG 
GGGCTTCAGT 
GGCCTGGGAA 
AGAGGCTGCC 
TGGGGGGCTC 
TACAAAAGTG 
GCAACTTGAG 
TGTAATTAGC 
TGGAAAAACA 
AGATTCATGC 
AAGTCCATCT 
TCTGTTGTTT 
TAGGTGGTAA 
AGAGGCCAGG 
TGAGTTGGCA 
CAGAGTAAGA 
GAGTCTCACT 
GCTGCAACCT 
TGAGTAGCTG 
ATTTTTAGCA 
TCTTGACCTC 
ACAGGCATGA 
GTGGCACGTG 
CCCTTGAGCC 
CACTTCAGCC 
AAAAGAAAAA 
GTAGAAGTTG 
TGTGGATTCT 
TTTTGTTTGT 
TCTAGTTCTG 
ATTTTAGGCA 
TGGATTTTTA 
TGGAGGCCTC 
GCAGGCCCCA 
GTGTGTGTGT 
AGCACCTGGT 
CCCAGCTCAC 



CTTTACACTT 
TCCAGACAAG 
AGTGGATTCT 
AGAGCTTCTG 
TATGTTGTGT 
GGATGACATG 
GTCTAGCTTC 
TTTGGCCCTG 
ATCTGTGTGT 
GTACCAGCTG 
GGTACTCCCT 
TGAGGATGGG 
ACAGAGGCAT 
CCAGGGATTC 
GGCCCTAGCT 
GCACCCCACC 
AAACCCAAGC 
GGAGGTGAGC 
GCCGGAGCAC 
CACTCAGCCC 
GAAGTGCCCC 
CAGAGACCTC 
GGGACAGGAC 
TCCCTCTGCG 
TGGCCACATC 
ACCTCAGCGC 
ACCTGGGGGA 
TTCTTGTGTG 
GCTGCCAGTG 
AGCTGTGGAG 
CACCTGAGTC 
CTGTTTTATT 
CTCCTGGATA 
AGTACCCAGT 
GCTGGTTCTC 
CCTATTTTTG 
CAGGTCGATT 
TTTATTTCCA 
AATAAATTTA 
TGCGGTGCCT 
GGATCAATTC 
CCCTGTCTCT 
CTGTTGCCCA 
CCACTTCCTG 
GGATTACAGG 
GAGATGGGGT 
AAGTGATCTA 
GCTACTGCTC 
CCTGTAGTCC 
CAGGAGGTTG 
TGGGTGACTG 
AAAAGGAAGA 
CCATGATTAA 
TCCAGTCTTC 
TTGGTGTCTG 
CTTGTGGCTT 
GAGGTAGTTG 
TCTAGACCAC 
AGCTTCATGC 
AGCTGTGTGT 
GTTGGTAAGG 
TCGGCAGGAG 
CACTTCATTG 



GGTCTTGGGG 
GTTCTAGACA 
TAGCTGCTTC 
GCTGGGTTCC 
TCAAGGTTTG 
GGGGTTGCTG 
TGACCCGAGC 
GGCTCTGTGG 
CCCAGGTGTC 
TCCCTGCAGC 
CCTCTCCACT 
AAGAGCTCTT 
CCTGGGGGTA 
CCCTCTCTGT 
CAGGCCCGAC 
ACCCCGGCCC 
TGGATCAGGC 
TCCTGCGGAG 
TGCCCTGGAA 
CTAGGAGTCA 
TGGGTAGGGT 
TCTGAGACAC 
CTGATGTCCC 
TGCCCCAGTC 
TCACAGGAAG 
CTTTGGGGAC 
GAGGGAAGGC 
CAAGATGAGG 
TGGGAAAGGG 
GCGCAGTGGT 
TTGCAAAGAC 
TGTGGAGCTC 
GTCTGTTCTA 
TGTTTATCAA 
TGAAGTAGGT 
CTGAGCAGAA 
TATTTTTTAA 
CATTACAAAA 
TAGTGAACAA 
CACGCCTGTA 
AGGCCAGGAG 
AC AAAAAT T A 
GGCTGGAGTG 
GGTTCAAGTG 
CATGCATCAC 
TTTACCATGT 
CCTGCCTTGG 
CTAGCCTAAA 
CAGCTACTCA 
AGACTGCAGT 
CGCGAGATCA 
AATGAAAGTC 
GCACTGTTAA 
TTTTCCCAGG 
TTTCATGGAC 
TTATCATAGC 
AGTTCCATGT 
CTTTTCAGTG 
AGCGGGCTCA 
GTGTGTGTGT 
GGAAAGCCCC 
GGAGACCAGC 
TAAAGCTCCC 



TCTAGGGTGG 
TTGGGCTGGG 
TGGGCTCTGG 
GACCTGGCTT 
GGGCCATGCT 
TGCTGTGTTC 
TCTGGCTGAG 
CCGTGGGCAG 
TCTGGATCAG 
GGGAGCCGCG 
CTGCCCTTCC 
AGCAGCCACC 
GGGCAGTAGT 
TCCCCGGGGC 
CATTTCCATA 
CCGGTACTGG 
CCTCGTGGTG 
CCTGAGCAGT 
GCCAGCACGA 
CAGGGCCTGG 
GGGGGTGGAG 
ACCTCATCAA 
CTTTACTCTC 
TGTGTTCCGG 
AATTCCAGAT 
CTCGACCAGA 
AACTCAGCCC 
TCACTGAGCC 
CACTTGCTTT 
ATCTCACAAA 
TGTGACCTGG 
ACAGCTGTCA 
ATGAATAGAT 
CAGTTCATAT 
TAAACATGCC 
AAAACTCCAT 
TGTTACCTGT 
ATCCACGGTA 
AATGTTTAAA 
ATCCTAGCAC 
TTTGAGCCCA 
T TAT TAT TAT 
CAGTGGTACA 
ATTCTCCTGC 
CGTGCCTGGC 
TGGCCAGGCT 
CCCCCCAAAG 
AAAATTTTTT 
GGAGGCTGAG 
GAGCTGTCAT 
CCCCCATCAA 
CCCTCTTTCC 
CAATATTAAG 
CAGGTGCACA 
AAACAGGATT 
TGCTTTATTT 
TTTCTCCCTG 
AGAATGACCC 
GCCTTAACCC 
GTGTGTGTGT 
TGGTTGGGTA 
ACCGGCTCCC 
TCTTTGTTTC 
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12601 TGGAACTTGG GTGTTTCCAT 
12651 CAGCAATTGT TGATATATCT 
127 01 TCTCTTTTTT CTTTTTTCTT 
12751 CAGAGTCTCA CTCTGTCGCT 
12801 TCACTGCAAC CTCTGCCTCC 
12851 CCCAAGTAGC TGAGATTACA 
12901 TTTTCTTTTT CTTTTTTTTG 
12951 TGGAGTACAG TGGTGTGATC 
13001 TCAAGTGATT CTCCTGTCTC 
13051 GCGCCACCAT GCCTGGCTAA 
13101 CTCACTCTGT CACCCAGACT 
13151 CAAGCTCTGC TTCCCGGGTT 
13201 TAGCTGGGAC TACAAGCACC 
13251 TTTAGTAGAG ACGGGGTTTC 
13301 CCACTAGCCT CAGCCTCCCA 
13351 CACCCAGCCT AATTTTGTAT 
13401 CGCAGGCTGG TATTGAACTT 
13451 TCCCGAAGTT CTGGGATTAT 
13501 TTGTATTTTT AGTAGAGATG 
13551 GAACTCCTGA CCTCACCTCA 
13601 GCTGGGATTA CAGGCATGAG 
13651 CTTGACAGAA CTTCACGCCT 
13701 GTTTTCCACT TAACCCTTGA 
13751 AGAACTACCT CATTCGTTAG 
13801 AGTCCATCAT TTCCCTAACC 
13851 CCAGTTTTTC AGTATGATTC 
13901 TGATCCACTC AGGCCAGTAT 
13951 AATTGGATCA AAAGATATGC 
14001 CTGACCTCAG ACAAGGTTGT 
14051 AGTGCCTGCT TCCCAACTTC 
14101 ACAATTTTGC CAGTCTCATT 
14151 TCTTTAATAC TAAGTGGGGG 
14201 TTTATTTCTT CTGTCAATTG 
14251 TGGGTTTGTT GGTCTTTTTC 
14301 TATTAACCCT TTGCTGTTGA 
14351 TCATTTATGT GTCTTTTTCC 
14 401 ATAGGTCAGT CTTTCCCTTC 
14 451 AAGGCCCTCA GCCCCTCAAG 
14501 TTTTTTTTTT CTGAGACAGG 
14551 CAGTGGCATG ATCTTGGCTC 
14 601 ATTCTCGTGC CTTAGCCTCC 
14 651 TATGCCTGGC TAATTTTTTG 
147 01 TTGGCCAGGC TGGTCTTGAA 
14751 CTCCCAAAGT GCTGGGACTA 
14801 TTTTTTTCAG ATAGCCAGTT 
14851 CCTGGATCAC ACATTATGAG 
14 901 GAGGGCCAGT CCCGATGGGA 
14 951 CTGTGCAGAC ACTGTTGTAA 
15001 CACCAAAATC CTATGAAATG 
15051 AAGGAAACGG AGGGACAGGG 
15101 GAAGTAGAGC TGCGATTTGA 
15151 TTCTTTCCCT AAAATGTATT 
15201 CCTGGGTTAT GTGGGAAACC 
15251 GATGCAGGAG AGAGAGGGAT 
15301 AACTCATTAG AGAGTTCTCC 
15351 CAGTACACTT CGGGAGCCCG 
15401 CATTTTAGTT TGCACACCAA 
15451 TGAGAAGGAA CCCAGGGAAA 
15501 CGGCACTTCC CACATCTGCC 
15551 ACTCTGCAAC AGGAGTCCAA 
15601 AGTGGGATGG GTTAGGCCAG 
15651 GAAGCGCCCC CGCAACTCCC 
15701 ATGGTTTCCT ATTTCCTGCG 




TTCTTTCTTA CAAAATTATC TATGCATTTA 
TTAGGCAGCA TCTAGGTACT TGTAGTGGGT 
TTTTTTAATC ACCCTCTCTT TTTTTTGAGA 
CAGGCTGGAG TGCAATAGCG CGATCTTGGC 
CAGGTTCAAG TAATTCTCAT GCCTCAGCCT 
GGCACTGGCC ACCAGACCCG GCTAATTTTT 
AGACGGAGTT TCGCTCTTTG TTGCCCAGGC 
TCGGCTCACT GCAACCTCCG CCTCCCGGGT 
AGCCTCCCGA GTAGCTGGGA TTACAGGCGC 
TTTTGTATTT TTTTTTTTTT GAGACAGAGT 
GGAGTGCGGT GGCGCGATCT CGGCTCACTG 
CATGCCATTC TCCTGCCTCA GCCTCCGGAG 
CACCACCGTG CCCGGCTAAT TTTTTGTATT 
ACCGTGGTCT CGACCTCCAG ACCTCGTGAT 
AAGTGCTGGG ATTACAGGCG TGAGCCACCT 
TTTTAGTAGA GATGGGGTTT CACCATGTTG 
CTGACCTCAG GTGATCCGCC CGCCTCGGCC 
AGGCGTGAGC CACCGCACCT GGCCTAATTT 
GAGTTTTACC TTGTTGGCCA GGCTGGTCTT 
GGTGATCTGC CCACCTCGGC CTCCCAAAGT 
CCACTGTGCA CCCGGCCTAA AAATCACCAT 
TGCTTTTTGT TTTTTTTCAT CTTTGTGCTT 
TC AC AG AC AT CTTTCCATGT GGATTCATGT 
AACAGCTGCA GAGTATTCCA CTGTGCGGTT 
ATCCTCCTGC TGATGGACAG TTAGACTGTT 
TATGCCAGGC TGCCATGAAC GTCCTTTTAC 
TTCTGTAGGA GAAATTCCTA GAAGTGGGAT 
ACATTCTAAA TTAGGAGAGA GACTGCCAAA 
ACCAGTTTGC ACCCCCATCA GCAGCGTACA 
CTCGCCAACA GGGATGCTAT AAAAAGCTTC 
GGCAAATGGT ATCTTGGTTA AATTTGCATT 
TAGGGTATCT TTTCATATGT TTATTGGCCA 
CCTGTTCTGA TTCCTTGTCC ATTATTCTAC 
TCATTGATTT TTAGAATCTC TGTTAATGGA 
ATGTGTTTGC AAATATTTTC TCCCTGTCTG 
ATATAAATTT AAAAAATTTT GGTGGGCTCA 
CGGGCTTCTG GGATTTGTGT TCGGGGTAGA 
ATTATAAAAT TATAAAACCT TTTCTTTTTT 
GTGTCTTGCC ATGTCACCCA GGCTGGAGTG 
GCTGCAACCT CCACCTCCCA GGTTCAAGTG 
CGAGTAGCTG GGATTATAGG TGCCTGCCAC 
TATTTTTAGT AGAGACGGGG CTTTGCCATG 
CTCCTGACCT CGTGATCCAC CCGCCTTGGC 
CAGGCGTAAG CCACTGTGCT CGGCCCTATA 
ATCCTAATGC TCCCTTGATT TGATGGACCA 
CCCCCTCATA AGCAGGTGGG AGTCTCAAGC 
ATAGCACTTG GTGGCTGAGG ACCCTCCTAT 
AACTTCACAT GCATCATCTA ATTTAGTCCT 
TAGGAATGAT CATTACACCC ATTTATAGAT 
AGATTACTCC GCTACAGGTC AAGAGGCAGG 
ACTGAGGTCT GTGTCTAGAA CACGTGCTCA 
CATAGGTGAA AAAGGGCTTC TGCGGAAAGC 
CTGGATTTAC AGCTGTCTTT CCAGCAGGAT 
GCGATTTCTC CCAATCTCTC CTGGTCCCAG 
CTGCTGAGGG CTCCCGACTG GTGTTGCACA 
AGGCTGATGG TTCCATGGAA AGTACACAGT 
GTGTGAAGTG GGCAGGACAG GCCACTGTTC 
GGGACTGGCC CAAGACCACA CACTGGTTAG 
TGACCCCTAG TCCAGTGCCG CCTTTTCTTT 
AATCAGGAGT TCCATGAGGA CACTGGGAAC 
CGGTGGATGG TTCTGGGGAG GGCCCGAGCT 
CACAGGGATG GCTGCATCAG CAGGGAGGAG 
CTCCAGCTCT GTGTTGGGGG GGCGCATGGG 
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CTTCGTACAC 
GCCACTGCAA 
CACGCCCCTC 
AGAGCCCTTC 
CTGTGGTTCT 
CCTCAAGAAG 
ACGGCCCCTC 
ATGCCGAGGT 
TTGTTTGTTT 
AAGGAGTGAA 
CAAGCCTGTA 
AGGTCAGGAG 
ACTAAAAATA 
AGCTACTTGG 
GTTTGCAGTG 
GAGGGCTCTG 
AACCCCACAA 
TTAAGAGGCC 
GGGAGAAGGC 
GACTCTTATT 
GGGGCGCAAG 
GGCTGGCTCT 
GCAGTGCAAG 
TGAGCCTGGA 
CACCGCGCCT 
GCCTCCAGGT 
GCAGGGCTAC 
TAATTCTCTT 
TGGCTCACGC 
ACGAGGTCAG 
TTTACTAAAA 
CTGTAGTCCC 
GGGAGGCGGG 
GGGCGATAGA 
GAAGGGACAG 
TGAGGCTGAG 
AGGCTTCAAG 
GGATTGGAGA 
CTGCTCTGGA 
GCAGCTCTCT 
ACTGGCCAAA 
CTGGGGTCTA 
AAAAAACAGA 
AATCCATTTG 
TACACACAAA 
AAAAAGCTGT 
CTGAGGCCAG 
CCAAGGTGGG 
ACGTGGCAAA 
CGTGGTGGCA 
GAATCGCTTG 
ATTGCACTCC 
AAAAAAAAAA 
GGGTGGGTCT 
GGGTTGATGA 
TCCCAGCAGA 
TTTGACATCC 
AGCTCCCCAC 
TGCTTTATCC 
GGGGGTGTCT 
CAAGGACTCA 
CTGGGGGTGT 
GCATGCAGCT 



AACTTCCAGG 
AGCCCTGGTG 
CAGCCCCGGC 
TCAAGCCAGG 
GCCCGGGGCC 
GAAGAAGTAG 
CTCTATTGCA 
GAGATGGAAT 
GTTTGTTTGG 
GTACCTTAAA 
ATCCCAGCAC 
TTCAAGACCA 
CAAAAAGTAG 
GAGGCTGAGG 
AGCCGAGATC 
TCTCAAAAAA 
AATCAGAGGC 
ATTTATTTTG 
ATAGACTGAC 
TTGGTGGGGC 
CCTGCGGGAT 
CCATTTGCTC 
GATCGCCTGT 
GGGGTCTGCA 
TCAGCTTCTC 
AAGAGGGAGT 
TGGGGCAAAG 
TTTGTGGTGG 
CTGTAATCCC 
GAGATCGAGA 
ATACAAAAAA 
AGCTACTTGG 
GCTTGCAGTG 
GCAAGACTCC 
GTAAGGGTGC 
G AG AG AG AC C 
ATTCCAGTTA 
AGGTGTTATA 
GGCTCTTCTG 
TGGGGTATTT 
TGATTATTTT 
AGGTAGTTGA 
GGCAGGCACA 
CATTGAGCCT 
ATTGACATTT 
GGCAAAACAA 
GCATGGTGGC 
TGGATCACCC 
ACATCATCTC 
GGCGCCTGTA 
AACCTGGGAG 
AGCCTGGGCG 
AAATTGGGCT 
GCTTCTGGGA 
AGTCCCAGGG 
GATCCGTGAG 
ACTTGTAATA 
TGAGGGCTGG 
AAATGGCTCC 
TCCTCACAAC 
TTCCTGCCTT 
CGGGGCAGGA 
GAGGGCAGGG 



AGAGCAACTC 
AGAGTCCCTT 
CCCGCCCTCC 
AAAACCTGGT 
CTGAGGCGGG 
AGTCATCACC 
GATCCTGGGC 
GACTGGAAGG 
GAGAGTTACT 
ATCAGAGGCG 
TTTGGGAGGC 
GCCTGACCAA 
CTGGGCGTGG 
CATGAGAATC 
ACGCCACTGC 
AAAAAACAAC 
TCAAGATGAC 
ATGACGCAGC 
AATTAGGAGG 
AGCTGCTCAG 
GGGATGGAGG 
TCCCCCAGCC 
CAGTTGAGTG 
CCCTCACCCT 
TCTGCCCCGC 
CATTCTGTAC 
AACGCAGGAT 
GATAATAAAG 
AGCACTTTGG 
CCATCCTGGC 
AAAAAATTAG 
GAGGCTGAGG 
AGCCGAGATC 
GTCTCAGAAA 
CAGAAAGTGG 
CTAATTTATA 
CAGTCTTATT 
TGAGCCATTG 
GGGAAAGTCC 
GATGGTTTTA 
GCTGAGAACA 
TCACAACTGT 
GGGCATACAT 
TCCAGAAGTG 
AAGCAAACTG 
AAACATAGAA 
TCACGCCTGT 
GAGGTCAGGA 
TACAATACAA 
ATCCCAGCTA 
GTGGAGGTTG 
ACAGAGAGAG 
GTGAGGTCAT 
TGATGTGGAT 
ACCTGGAAGT 
GAGGAGGTAC 
GATGGTGAGT 
GGGGGAGCTG 
AAGCCAGGTG 
CTGTTTTTCT 
GGAGAAAATA 
GGCTGGGGAT 
CCAGGGCTGG 



CTTGCGCCCC 
TCCCGGCTCA 
CTTCTGGCCC 
AATTCTATTT 
CTCTAAAGCC 
TCTAAATCCC 
ATCTACAAGC 
GCTGCTGGGC 
ATTTTGGTGG 
CATGGCCGGG 
CGAGGCGCGC 
CATAGCGCAA 
TGGCACCCAC 
GCTTGAACCT 
ACTCCAGCCT 
AAAAAAACCC 
TGATGTGAAG 
TGCCCAGGAA 
AGGAGAACAC 
GAACAAAGGT 
GTATTCTGAC 
TGTGGAGTGA 
TCGGCGCAGG 
CACCCATGCA 
CCTGGCAGGC 
TGGCCTGTGG 
GGAAGCCATT 
AAGGGACAGG 
GAGGCCGAGG 
TAACACGGTG 
CCAGGCGTGG 
CAGGAGAATG 
GCGCCACTGC 
AAAAAAAAAT 
CCAGGAAGCC 
AAGAGGTATA 
TTGTTGGAGG 
GCTTGCCTTT 
CTTGCCCTGA 
GGTCAGTTTG 
GTCCGAACAA 
TTGGGTTGGC 
CCTCAAAAAT 
CTGGGGTCTA 
CGCTGACAAA 
AAAGAGCCTC 
AATCCTAGCA 
GTTGGAGACC 
AAATACAAAA 
CTTGGGAGGC 
CAGAGAGCCG 
ACTCTGTCTC 
GCAGGGAATT 
GCCTCCCGTG 
GTGTTCTGCA 
AGACGGTGGA 
CCTCCCACAG 
GGGAGTATCA 
GGCTACTACC 
CTTCCCAGCT 
CTTCAACCAG 
GGGGGTGGGA 
TGTCCCTAAG 



GTCGCCTGCC 
CGGCCCAAGC 
CGCCTCTGCC 
GCCTCTCCTC 
CTAGTCTCAC 
TCCTCCCACC 
AGGGCCTCAA 
AGTGTTTTTT 
GGCAATTGCC 
CATGGTGGCT 
AGATCACCTG 
CCCCGCCTCT 
CTGTAATCCC 
GGGAGGCGGG 
GGGCAACAGA 
CCAAAACCAA 
GGAGTGGCGT 
CAGAGAACAT 
TTTGGAAGGA 
TCCTGGTAGG 
CAATGTCCCT 
ACTGCCACAA 
GCCCAGAGTG 
CAGCCACCAT 
GAGGCTCCAG 
AGGGAAGGAT 
CCAAAGTGCA 
CCGGGCGCGG 
CGGGCGGATC 
AAACCCCATC 
TGGCGGACGC 
GCATGAACCC 
ACTCCAGCCT 
AAAAAATAAA 
CTGGACCTTC 
AAAGTGAAAG 
GGTTAACAAA 
CCCTTTCTGG 
TAATGTCCTG 
CTGAATGACA 
CTATGTTAAA 
ATAAGTCCTC 
AGAAAAGATA 
AAATGTGAAA 
TCTGTGGCTG 
AAAAATTGGG 
CTTTGGGAAG 
AGACTGGCCA 
ATTAGCTGGG 
TGAGGCACGA 
AGATTGCGCC 
AAAAAAAAAA 
GATTTTTGGT 
GAGAGGGGAA 
GCAATCCCCC 
GGATGGGGTG 
CTGGCACCAG 
GGGAAATGGG 
TTGTTGTTAG 
GTGGTTGGAT 
AGCAGGGAGC 
TATGAGGGTG 
GTTGTACAGA 
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18901 CTCTTGTGAA TATTTGTATT TTCCAGATGG AATAAAAAGG CCCGTGTAAT 
18951 TAACCTTCAC CATCAGCGCC TAGAATCCCG GGGGGTAGGG GGATGGTATA 
19001 CTTTACAGGA TGACAATCTT GGGAGCTAGA ACTTTGTAGC CAGAGAAACT 
19051 TGGGAGGTCT GGAATCTCAT GTGTCTGGAG TCTTGGGGAA GAGAATCTTA 
19101 GAAGCAGAAA ACCTTGGAAC ATAAGAATCT TGGGGAGGGT CTAGGATCTT 
19151 GAGGAGACCA GATCCTTGGA CATCTAAAAC TTGAAACTAG TAGGTCTGCA 
19201 CCCGAGAATT GCAGGGCCAG TCATGCATAC CCAAAGCCTT CAGCCCATGG 
19251 CCGAAATTCC CTTGCTGGAC AGGGGGCCTT TCAGCCCCTG CTTGGACGCT 
19301 TCCAGTAACA GGGCCCTCAC TGCAGGAATC GTGGGAGGGA GAGGGGCAGC 
19351 ACAGAGTTGC TGGCTGTCGG GGAAGGGAGG GAGGGCCCTG GGCAGTCCGA 
194 01 GGGCCCTGCT GGGCTTGTGC CTCAGGGTGG GGGCTGCACT CCTCCGCCTT 
19451 GCAGCCTCCT GGCCTGGTGC TGCTGCCAGC CGGAAGGACA GTGACTTCCA 
19501 GAGGAAATGC ATATTGATCC TGCTTTCAGC CTCCGGTGGT GGCTTCTCCC 
19551 AACCCAGCTC TTCCCTCCTG AGCCTGCAGC ACGGAGGTTT TGGGGGTCAC 
19601 TGCTACCTAA AGAAGGCTAA GGCCACTTCT GAGGCTGGTC TGGGAGTTTA 
19651 CTAAAGGTTC TGAAGCTGGG CCGGGCTGCC CCTGGGATCA GGAGACTCCA 
19701 GACAGCAGTC CTGACAATGG GAACTACCTC CTCAGTCCCC CAAACTGGGA 
19751 GGTGTCCCAC AGCAGCTGTA GGATTGTCCT AGGGGTGGAG ACCTGAGCAC 
19801 CTTCCACTCC AAAGCACAGT ATCTGTGGGC CTGGCAGTGG CCTCAGTTCC 
19851 CCCATGAGTG CCCCGGTCCC CCACCCCAGG GTTTCCCCAC ATCACATCCA 
19901 TCCCTGCTTT GAGACCCCAC TCCCCCTGGC CTGTTCTTTA TTTTGGGTCA 
19951 CTCCCTTCTC TTTCCTGGTC ATATCTCTCC TGCAGGCCTA CCCTGTGTTG 
20001 GGCCCCCCAG CCCTGTCTCT GCATCGGGTG CCCCCCTGCC CCTCCTTCTG 
20051 TCCTCAGCCC CCTCCGCCCT TCCCCCTCTT GAGGCTGTAA TATCCGTTTC 
20101 ACGATTTGGG GGCTGAGTTG CTATAACAAC AGACGGCGAT TGTGTTGTGA 
20151 AGAGCAGCTC GCTCCTGTGC CGCCTGCCTC CTGTGCTGCC TCCATCCCTG 
20201 CAGCCCAGTC GGTTCCTCTT GGCTCCTCTC GTCACTACCC TCCAGTTCCA 
20251 GTCTGGCCTC TTCCTGGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTAT 
20301 GCATGCATGC ATATGTGTGT CCAGGTCTGC CTGCCCGGGA TGTGACAAGT 
20351 AGCGGTCTTC ATGGTTGCAT GTGTCTGAAT TTGGTGTCTG AGCTTCACAT 
20401 TGTATGCGCC TGTGTGCATG TGTGTGCATG GACATGCATG CTGTATCTGC 
204 51 TGTGTTTCCC CTCCCCCATG TGTCCCCACT GGCCTTTGCA CATGGGAGAA 
20501 GGGCATGTGC TCAGCATATC ACTCAACTGT CCACATTGGG TGGGTACCTG 
20551 TGTGTGGTGT GTGTGTGTGG GGGGTGTGTC TTGAAGTGGC AGGTCCCAAA 
20601 TGCTTAGGCA ATCTGAACCT TGGACCTTGC AGAGAGGAGA GATGTCCCTG 
20651 TAGGTGGGAG GGACAGGGAG ATGCAGCAGC TGCCCGGTGA CCTTTTCTGC 
20701 CCTTGATGGG CAAAGCTGGG GGTAGGGAAA GGAGACAAGT GCTCATACTT 
207 51 ACCTCCCTCC CTGCCCAGGC TCCTCTGTAA GGGTCTGAGT CTGTCTCTGT 
20801 GAGCCATTGC ATCTGTCTGT CTATGCCCTG ATGCCTGGAT GGACAAGGGG 
20851 TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG AGTGTGAGGC TGCAGGAAGA 
20901 GGAACAGTGG GGGATGGGCA GGAAAGTGGG CTGTGGGGTC AGGGAGGCGA 
20951 T (SEQ ID NO:3) 



FEATURES 






Start: 


3000 




Exon: 


3000- 


-3072 


Intron : 


3073- 


-3753 


Exon: 


3754- 


-3855 


Intron : 


3856- 


-4363 


Exon: 


4364 


-4427 


Intron : 


4428 


-4786 


Exon: 


4787 


-4918 


Intron: 


4919 


-5702 


Exon: 


5703 


-5853 


Intron: 


5854 


-6056 


Exon: 


6057 


-6230 


Intron : 


6231 


-6389 


Exon: 


6390 


-6506 


Intron : 


6507 


-8832 


Exon: 


8833 


-9114 


Intron: 


9115 


-9885 


Exon : 


9886 


-9963 
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Intron: 


9964- 


10201 


Exon : 


10202 


-10324 


Intron: 


10325 


-10638 


Exon: 


10639 


-10754 


Intron: 


10755 


-15675 


Exon: 


15676 


-15817 


Intron: 


15818 


-16071 


Exon: 


16072 


-16108 


Intron : 


16109 


-16828 


Exon: 


16829 


-17008 


Intron: 


17009 


-18491 


Exon: 


18492 


-18565 


Stop: 


18566 





CHROMOSOME MAP POSITION: 

Chromosome 11 



ALLELIC 


VARIANTS 


(SNPs) : 




DNA 








Position Major 


Minor 


Domain 


5539 


C 


G 


Intron 


5658 


T 


G 


Intron 


5861 


C 


T 


Intron 


6023 


A 


G 


Intron 


6799 


C 


T 


Intron 


9579 


C 


A 


Intron 


9842 


T 


C 


Intron 


10159 


T 


C 


Intron 


12025 


A 


- G 


Intron 


14723 


T 


C 


Intron 


14996 


G 


A 


Intron 


16153 


T 


G 


Intron 


16181 


G 


A 


Intron 


16756 


A 


G 


Intron 


18059 


A 


G 


Intron 


18364 


A 


- T 


Intron 


18861 


G 


A 


Beyond ORF(3' ) 


20443 


G 


A 


Beyond ORF(3 f ) 


20881 


A 


T 


Beyond ORF(3 f ) 



Context : 



DNA 

Position 

5539 AGACTCAGTTCAGGCATGAAGTCTCCGTGGGCTCTGAGGGTTCGGGGCTCTTCCGGGGTA 
GAATTTGTCGTTCCCACCTCTGTTTTCCATGGCACTTTGTACAGACTCCTGTACAAAGAC 
CTCTGTACATGTGTCACGCTGTTTTGTGATCATGTGTTTCTGTGTCTGTCTCCCTCAGTA 
GACTGTGAGCTCCTCGAGGGCAGGAACCGTGTCTTACTCATCTCTGTATTCCCAGCGCCT 
AGCACAGTGCCTGGCACAGAGTACGTTGTTCATAAATGTGTGTTGAGTGCATGACGGGGT 
[C,G] 

GGGGGAGATGAGGAGGAGTTGCTGGGACTGGGAACATTCGTGCCTAGGACAGTGCCTCGC 
ATTATGTAGGTTCTCAGTAAGCGTGAATGGTGTGTCTGTGTGAGTGGGGGGCCACGAGGC 
ATGCGCATGTCCAGCAAAGGGCTCACTACCCCTGCCCCCCCAGCCCTACCTACAAGTGGA 
AGCGGCAGGTGACTCAGCGGAACCCTGTGGGACAGAAAAAGCGCAAGATGTCCCTGTTGT 
TTGACCACCTGGAGCCCATGGAGCTGGCGGAGCATCTCACCTACTTGGAGTATCGCTCCT 

5658 CCTCTGTACATGTGTCACGCTGTTTTGTGATCATGTGTTTCTGTGTCTGTCTCCCTCAGT 
AGACTGTGAGCTCCTCGAGGGCAGGAACCGTGTCTTACTCATCTCTGTATTCCCAGCGCC 
TAGCACAGTGCCTGGCACAGAGTACGTTGTTCATAAATGTGTGTTGAGTGCATGACGGGG 
TGGGGGGAGATGAGGAGGAGTTGCTGGGACTGGGAACATTCGTGCCTAGGACAGTGCCTC 
GCATTATGTAGGTTCTCAGTAAGCGTGAATGGTGTGTCTGTGTGAGTGGGGGGCCACGAG 
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[T,G] 

CATGCGCATGTCCAGCAAAGGGCTCACTACCCCTGCCCCCCCAGCCCTACCTACAAGTGG 
AAGCGGCAGGTGACTCAGCGGAACCCTGTGGGACAGAAAAAGCGCAAGATGTCCCTGTTG 
TTTGACCACCTGGAGCCCATGGAGCTGGCGGAGCATCTCACCTACTTGGAGTATCGCTCC 
TTCTGCAAGATCCTGGTGCGGCCCGAGGGCTGGGGGGTCAGGGGTCCAATGTGGGCTGGA 
AGAGAGTTCTAGGAGGGGCAGGGTCCCTGGCGTAGGCTGGGTCACAGGGTGCATCAGGGG 

5861 CTGGGACTGGGAACATTCGTGCCT AGGACAGTGCCTCGCATTATGTAGGTTCTCAGTAAG 

CGTGAATGGTGTGTCTGTGTGAGTGGGGGGCCACGAGGCATGCGCATGTCCAGCAAAGGG 
CTCACTACCCCTGCCCCCCCAGCCCTACCTACAAGTGGAAGCGGCAGGTGACTCAGCGGA 
ACCCTGTGGGACAGAAAAAGCGCAAGATGTCCCTGTTGTTTGACCACCTGGAGCCCATGG 
AGCTGGCGGAGCATCTCACCTACTTGGAGTATCGCTCCTTCTGCAAGATCCTGGTGCGGC 
[C,T] 

CGAGGGCTGGGGGGTCAGGGGTCCAATGTGGGCTGGAAGAGAGTTCTAGGAGGGGCAGGG 
TCCCTGGCGTAGGCTGGGTCACAGGGTGCATCAGGGGTTTCAGTGTAACCACTGAAGGTC 
AGCTGGAGGGTGAGGAGTGGCTATCAGTGAGGGGAGAGGCCGGCAAGGTGCTGAGGCCAC 
TCCTCATGCCCCCAGTTTCAGGACTATCACAGTTTCGTGACTCATGGCTGCACTGTGGAC 
AACCCCGTCCTGGAGCGGTTCATCTCCCTCTTCAACAGCGTCTCACAGTGGGTGCAGCTC 

6023 GGCAGGTGACTCAGCGGAACCCTGTGGGACAGAAAAAGCGCAAGATGTCCCTGTTGTTTG 
ACCACCTGGAGCCCATGGAGCTGGCGGAGCATCTCACCTACTTGGAGTATCGCTCCTTCT 
GCAAGATCCTGGTGCGGCCCGAGGGCTGGGGGGTCAGGGGTCCAATGTGGGCTGGAAGAG 
AGTTCTAGGAGGGGCAGGGTCCCTGGCGTAGGCTGGGTCACAGGGTGCATCAGGGGTTTC 
AGTGTAACCACTGAAGGTCAGCTGGAGGGTGAGGAGTGGCTATCAGTGAGGGGAGAGGCC 
[A, G] 

GCAAGGTGCTGAGGCCACTCCTCATGCCCCCAGTTTCAGGACTATCACAGTTTCGTGACT 
CATGGCTGCACTGTGGACAACCCCGTCCTGGAGCGGTTCATCTCCCTCTTCAACAGCGTC 
TCACAGTGGGTGCAGCTCATGATCCTCAGCAAACCCACAGCCCCGCAGCGGGCCCTGGTC 
ATCACACACTTTGTCCACGTGGCGGAGGTGCCTGCCCCTCCCTCCCGGTGTCTCCCAACC 
ACCCCACATGCCAGTCAGGCCAACCCTTCCCTTCCCCTAACCCACTGCCTTCTCTCTAGA 

67 99 CCATCAAGGTGCCTGGGACTGGGGAGGGGCCGGTGCTTCCCAGGTCTGTCTTCACTGGGT 
CCTCCCAGCAGCACTGGGGGCTGGGCACAGCTGTCCTCATTTGATAGATATGGAAATGGA 
GGCTCAGAGGGGTTAAGTGCTTTTCTCAGTTTGCACAATGGCAACAGCAGAGTGGGGGCT 
CACAGGTCGTCAGGGACCCCAAAGCTAGTACTTTTTTTTTTTTTTTTAAGACAGGGTCTC 
TCTCTCTGTTGTCCAGACTGGAGTTCAGTGGTGCAGTCACAAGCTCACTGCAGCCTTGAA 
[C,T] 

TCCTGAGCTCAATCGATCCTCCCACCTCAGCCTCCTGAGTAGCTGGGACTACAGGTGTAC 
GCCACCATGCCTAATTTTTGTATTGTTATTAATTTTTTTTTTTTTTTTTTAGAGATGGGG 
TTTTGCCATGTTGCCCAGACTGGTCTTGAACTCCTGGGCTCAAGTGATCCGCCTGCCTTG 
GCCTCCCAAAGTGCTGAGATTATGGCTTGAGCCATTGTGCCTTGCCACTTGTAGTTTCTT 
CTTTTCTTTCTCCTTCATTTTTTATTATTTTTGAAGTATTTTGAAGTATTGAGTAACATA 

957 9 CTGGTTGGGGCTGTGGACTGAGGTCTGATCTCCAGGCTGGTATGTGGACTGTGGGCAGTT 
TGAACTGGGCCTGGGTCCCGGGTTGAGTTCTGGCAATGGGCTGTGTTCTAGGGCTGGGCC 
AAGCTCTGCATTCTGTGGGCAGGGGTGGTTTCTAAGCATGGCCCTGGGCTCGGAGTGAAG 
TTCTGGGCTTGGCTTTACACTTGGTCTTGGGGTCTAGGGTGGGAGTTGGGTTCTGGTTTA 
GATCCAGACAAGGTTCTAGACATTGGGCTGGGGCTTAAGTGTTAAGGTTTGGAGTGGATT 
[C,A] 

TTAGCTGCTTCTGGGCTCTGGAGGGGATCAGGGTTGAAATCAGAGCTTCTGGCTGGGTTC 
CGACCTGGCTTCTTCCCTGACATCTTGGCAATATGTTGTGTTCAAGGTTTGGGGCCATGC 
TGTGGTTTGATCTGTGCGCTGGGATGACATGGGGGTTGCTGTGCTGTGTTCTAAGCCAGG 
CTTTGTCCTGAGTCTAGCTTCTGACCCGAGCTCTGGCTGAGCTGTGGCCTCTAGGTCGAC 
CTTTGGCCCTGGGCTCTGTGGCCGTGGGCAGGGGCCAGTGGGGGTGATCAGATCTGTGTG 

9842 TGGGCTGGGGCTTAAGTGTTAAGGTTTGGAGTGGATTCTTAGCTGCTTCTGGGCTCTGGA 
GGGGATCAGGGTTGAAATCAGAGCTTCTGGCTGGGTTCCGACCTGGCTTCTTCCCTGACA 
TCTTGGCAATATGTTGTGTTCAAGGTTTGGGGCCATGCTGTGGTTTGATCTGTGCGCTGG 
GATGACATGGGGGTTGCTGTGCTGTGTTCTAAGCCAGGCTTTGTCCTGAGTCTAGCTTCT 
GACCCGAGCTCTGGCTGAGCTGTGGCCTCTAGGTCGACCTTTGGCCCTGGGCTCTGTGGC 
[T,C] 

GTGGGCAGGGGCCAGTGGGGGTGATCAGATCTGTGTGTCCCAGGTGTCTCTGGATCAGTA 
TCAGACGGAGGATGAGCTGTACCAGCTGTCCCTGCAGCGGGAGCCGCGCTCCAAGTCCTC 
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GGTGAGGGGGTACTCCCTCCTCTCCACTCTGCCCTTCCCTCCTGAGAATCCCAGGATGTG 
AGGATGGGAAGAGCTCTTAGCAGCCACCTCACCCATCCATCTTGTAGGACAGAGGCATCC 
TGGGGGTAGGGCAGTAGTGTTGGGCAGACTTCCCTCTCCCAGGGATTCCCCTCTCTGTTC 

10159 GGGGGTGATCAGATCTGTGTGTCCCAGGTGTCTCTGGATCAGTATCAGACGGAGGATGAG 
CTGTACCAGCTGTCCCTGCAGCGGGAGCCGCGCTCCAAGTCCTCGGTGAGGGGGTACTCC 
CTCCTCTCCACTCTGCCCTTCCCTCCTGAGAATCCCAGGATGTGAGGATGGGAAGAGCTC 
TTAGCAGCCACCTCACCCATCCATCTTGTAGGACAGAGGCATCCTGGGGGTAGGGCAGTA 
GTGTTGGGCAGACTTCCCTCTCCCAGGGATTCCCCTCTCTGTTCCCCGGGGCTCTGGGCT 
[T,C] 

CCCCTGCCTCTGGCCCTAGCTCAGGCCCGACCATTTCCATAGCCAACCAGCCCCACGAGT 
TGCACCCCACCACCCCGGCCCCCGGTACTGGAGGAGTGGACCTCGGCTGCCAAACCCAAG 
CTGGATCAGGCCCTCGTGGTGGAGCACATCGAGAAGATGGTGGAGGTGAGCTCCTGCGGA 
GCCTGAGCAGTGTGTGGGGAGAGGCCAGTTTGCCGGAGCACTGCCCTGGAAGCCAGCACG 
AGTGTCCTGTTCAAGACCCAGCACTCAGCCCCTAGGAGTCACAGGGCCTGGCAGGCCAGC 

12025 TGGGGTTTTACCATGTTGGCCAGGCTGGTCTCAAACTCTTGACCTCAAGTGATCTACCTG 
CCTTGGCCCCCCAAAGTGCTAGGATTACAGGCATGAGCTACTGCTCCTAGCCTAAAAAAA 
TTTTTTTTGGGCATGGGTGGCACGTGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAG 
GAGGAACCCTTGAGCCCAGGAGGTTGAGACTGCAGTGAGCTGTCATCACACCACTGCACT 
TCAGCCTGGGTGACTGCGCGAGATCACCCCCATCAAAAAAAAAAAAAAAAGAAAAAAAAA 
[A,-,G] 

GAAGAAATGAAAGTCCCCTCTTTCCTTTTCCACTGGTAGAAGTTGCCATGATTAAGCACT 
GTTAACAATATTAAGCTTGGCAGTATGTGGATTCTTCCAGTCTTCTTTTCCCAGGCAGGT 
GCACATTGATAGAGATTTTGTTTGTTTGGTGTCTGTTTCATGGACAAACAGGATTAGAGC 
ATAAATCTAGTTCTGCTTGTGGCTTTTATCATAGCTGCTTTATTTCTTCTCCCAGATTTT 
AGGCAGAGGTAGTTGAGTTCCATGTTTTCTCCCTGGGTTGGTGGGTGGATTTTTATCTAG 

14 723 GGCTTCTGGGATTTGTGTTCGGGGTAGAAAGGCCCTCAGCCCCTCAAGATT ATAAAATTA 

TAAAACCTTTTCTTTTTTTTTTTTTTTTCTGAGACAGGGTGTCTTGCCATGTCACCCAGG 
CTGGAGTGCAGTGGCATGATCTTGGCTCGCTGCAACCTCCACCTCCCAGGTTCAAGTGAT 
TCTCGTGCCTTAGCCTCCCGAGTAGCTGGGATTATAGGTGCCTGCCACTATGCCTGGCTA 
ATTTTTTGTATTTTTAGTAGAGACGGGGCTTTGCCATGTTGGCCAGGCTGGTCTTGAACT 
[T,C] 

CTGACCTCGTGATCCACCCGCCTTGGCCTCCCAAAGTGCTGGGACTACAGGCGTAAGCCA 
CTGTGCTCGGCCCTATATTTTTTTCAGATAGCCAGTTATCCTAATGCTCCCTTGATTTGA 
TGGACCACCTGGATCACACATTATGAGCCCCCTCATAAGCAGGTGGGAGTCTCAAGCGAG 
GGCCAGTCCCGATGGGAATAGCACTTGGTGGCTGAGGACCCTCCTATCTGTGCAGACACT 
GTTGTAAAACTTCACATGCATCATCTAATTTAGTCCTCACCAAAATCCTATGAAATGTAG 

14 996 CCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCGTGATCCACCCGCCTTGGCCTCCC 
AAAGTGCTGGGACTACAGGCGTAAGCCACTGTGCTCGGCCCTATATTTTTTTCAGATAGC 
CAGTTATCCTAATGCTCCCTTGATTTGATGGACCACCTGGATCACACATTATGAGCCCCC 
TCATAAGCAGGTGGGAGTCTCAAGCGAGGGCCAGTCCCGATGGGAATAGCACTTGGTGGC 
TGAGGACCCTCCTATCTGTGCAGACACTGTTGTAAAACTTCACATGCATCATCTAATTTA 
[G,A] 

TCCTCACCAAAATCCTATGAAATGTAGGAATGATCATTACACCCATTTATAGATAAGGAA 
ACGGAGGGACAGGGAGATTACTCCGCTACAGGTCAAGAGGCAGGGAAGTAGAGCTGCGAT 
TTGAACTGAGGTCTGTGTCTAGAACACGTGCTCATTCTTTCCCTAAAATGTATTCATAGG 
TGAAAAAGGGCTTCTGCGGAAAGCCCTGGGTTATGTGGGAAACCCTGGATTTACAGCTGT 
CTTTCCAGCAGGATGATGCAGGAGAGAGAGGGATGCGATTTCTCCCAATCTCTCCTGGTC 

16153 CGCCCCTCCAGCCCCGGCCCCGCCCTCCCTTCTGGCCCCGCCTCTGCCAGAGCCCTTCTC 
AAGCCAGGAAAACCTGGTAATTCTATTTGCCTCTCCTCCTGTGGTTCTGCCCGGGGCCCT 
GAGGCGGGCTCTAAAGCCCTAGTCTCACCCTCAAGAAGGAAGAAGTAGAGTCATCACCTC 
TAAATCCCTCCTCCCACCACGGCCCCTCCTCTATTGCAGATCCTGGGCATCTACAAGCAG 
GGCCTCAAATGCCGAGGTGAGATGGAATGACTGGAAGGGCTGCTGGGCAGTGTTTTTTTT 
[T,G] 

TTTGTTTGTTTGTTTGGGAGAGTTACTATTTTGGTGGGGCAATTGCCAAGGAGTGAAGTA 
CCTTAAAATCAGAGGCGCATGGCCGGGCATGGTGGCTCAAGCCTGTAATCCCAGCACTTT 
GGGAGGCCGAGGCGCGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGACCAACAT 
AGCGCAACCCCGCCTCTACTAAAAATACAAAAAGTAGCTGGGCGTGGTGGCACCCACCTG 
TAATCCCAGCTACTTGGGAGGCTGAGGCATGAGAATCGCTTGAACCTGGGAGGCGGGGTT 
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16181 CTTCTGGCCCCGCCTCTGCCAGAGCCCTTCTCAAGCCAGGAAAACCTGGTAATTCTATTT 
GCCTCTCCTCCTGTGGTTCTGCCCGGGGCCCTGAGGCGGGCTCTAAAGCCCTAGTCTCAC 
CCTCAAGAAGGAAGAAGTAGAGTCATCACCTCTAAATCCCTCCTCCCACCACGGCCCCTC 
CTCTATTGCAGATCCTGGGCATCTACAAGCAGGGCCTCAAATGCCGAGGTGAGATGGAAT 
GACTGGAAGGGCTGCTGGGCAGTGTTTTTTTTGTTTGTTTGTTTGTTTGGGAGAGTTACT 
[G,A] 

TTTTGGTGGGGCAATTGCCAAGGAGTGAAGTACCTTAAAATCAGAGGCGCATGGCCGGGC 
ATGGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGCGCAGATCACCTGA 
GGTCAGGAGTTCAAGACCAGCCTGACCAACATAGCGCAACCCCGCCTCTACTAAAAATAC 
AAAAAGTAGCTGGGCGTGGTGGCACCCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGC 
ATGAGAATCGCTTGAACCTGGGAGGCGGGGTTTGCAGTGAGCCGAGATCACGCCACTGCA 

16756 CAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCAACAGAGAGGGCTCTGTCTCA 
AAAAAAAAAAACAACAAAAAAACCCCCAAAACCAAAACCCCACAAAATCAGAGGCTCAAG 
ATGACTGATGTGAAGGGAGTGGCGTTTAAGAGGCCATTTATTTTGATGACGCAGCTGCCC 
AGGAACAGAGAACATGGGAGAAGGCATAGACTGACAATTAGGAGGAGGAGAACACTTTGG 
AAGGAGACTCTTATTTTGGTGGGGCAGCTGCTCAGGAACAAAGGTTCCTGGTAGGGGGGC 
[A, G] 

CAAGCCTGCGGGATGGGATGGAGGGTATTCTGACCAATGTCCCTGGCTGGCTCTCCATTT 
GCTCTCCCCCAGCCTGTGGAGTGAACTGCCACAAGCAGTGCAAGGATCGCCTGTCAGTTG 
AGTGTCGGCGCAGGGCCCAGAGTGTGAGCCTGGAGGGGTCTGCACCCTCACCCTCACCCA 
TGCACAGCCACCATCACCGCGCCTTCAGCTTCTCTCTGCCCCGCCCTGGCAGGCGAGGCT 
CCAGGCCTCCAGGTAAGAGGGAGTCATTCTGTACTGGCCTGTGGAGGGAAGGATGCAGGG 

18059 AATGATTATTTTGCTGAGAACAGTCCGAACAACTATGTTAAACTGGGGTCTAAGGTAGTT 
GATCACAACTGTTTGGGTTGGCATAAGTCCTCAAAAAACAGAGGCAGGCACAGGGCATAC 
ATCCTCAAAAATAGAAAAGATAAATCCATTTGCATTGAGCCTTCCAGAAGTGCTGGGGTC 
TAAAATGTGAAATACACACAAAATTGACATTTAAGCAAACTGCGCTGACAAATCTGTGGC 
TGAAAAAGCTGTGGCAAAACAAAAACATAGAAAAAGAGCCTCAAAAATTGGGCTGAGGCC 
[A, G] 

GGCATGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAAGCCAAGGTGGGTGGATCACC 
CGAGGTCAGGAGTTGGAGACCAGACTGGCCAACGTGGCAAAACATCATCTCTACAATACA 
AAAATACAAAAATTAGCTGGGCGTGGTGGCAGGCGCCTGTAATCCCAGCTACTTGGGAGG 
CTGAGGCACGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGAGAGCCGAGATTGCGC 
CATTGCACTCCAGCCTGGGCGACAGAGAGAGACTCTGTCTCAAAAAAAAAAAAAAAAAAA 

183 64 TGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAAGCCAAGGTGGGTGGATCACCCGAG 
GTCAGGAGTTGGAGACCAGACTGGCCAACGTGGCAAAACATCATCTCTACAATACAAAAA 
TACAAAAATTAGCTGGGCGTGGTGGCAGGCGCCTGTAATCCCAGCTACTTGGGAGGCTGA 
GGCACGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGAGAGCCGAGATTGCGCCATT 
GCACTCCAGCCTGGGCGACAGAGAGAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAAAA 
[A,-,T] 

TGGGCTGTGAGGTCATGCAGGGAATTGATTTTTGGTGGGTGGGTCTGCTTCTGGGATGAT 
GTGGATGCCTCCCGTGGAGAGGGGAAGGGTTGATGAAGTCCCAGGGACCTGGAAGTGTGT 
TCTGCAGCAATCCCCCTCCCAGCAGAGATCCGTGAGGAGGAGGTACAGACGGTGGAGGAT 
GGGGTGTTTGACATCCACTTGTAATAGATGGTGAGTCCTCCCACAGCTGGCACCAGAGCT 
CCCCACTGAGGGCTGGGGGGGAGCTGGGGAGTATCAGGGAAATGGGTGCTTTATCCAAAT 

18861 ACTTGTAATAGATGGTGAGTCCTCCCACAGCTGGCACCAGAGCTCCCCACTGAGGGCTGG 
GGGGGAGCTGGGGAGTATCAGGGAAATGGGTGCTTTATCCAAATGGCTCCAAGCCAGGTG 
GGCTACTACCTTGTTGTTAGGGGGGTGTCTTCCTCACAACCTGTTTTTCTCTTCCCAGCT 
GTGGTTGGATCAAGGACTCATTCCTGCCTTGGAGAAAATACTTCAACCAGAGCAGGGAGC 
CTGGGGGTGTCGGGGCAGGAGGCTGGGGATGGGGGTGGGATATGAGGGTGGCATGCAGCT 
[G,A] 

AGGGCAGGGCCAGGGCTGGTGTCCCTAAGGTTGTACAGACTCTTGTGAATATTTGTATTT 
TCCAGATGGAATAAAAAGGCCCGTGTAATTAACCTTCACCATCAGCGCCTAGAATCCCGG 
GGGGTAGGGGGATGGTATACTTTACAGGATGACAATCTTGGGAGCTAGAACTTTGTAGCC 
AGAGAAACTTGGGAGGTCTGGAATCTCATGTGTCTGGAGTCTTGGGGAAGAGAATCTTAG 
AAGCAGAAAACCTTGGAACATAAGAATCTTGGGGAGGGTCTAGGATCTTGAGGAGACCAG 

204 4 3 TGTTGTGAAGAGCAGCTCGCTCCTGTGCCGCCTGCCTCCTGTGCTGCCTCCATCCCTGCA 
GCCCAGTCGGTTCCTCTTGGCTCCTCTCGTCACTACCCTCCAGTTCCAGTCTGGCCTCTT 
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» * 

CCTGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGCATGCATGCATATGTGTGTCC 
AGGTCTGCCTGCCCGGGATGTGACAAGTAGCGGTCTTCATGGTTGCATGTGTCTGAATTT 
GGTGTCTGAGCTTCACATTGTATGCGCCTGTGTGCATGTGTGTGCATGGACATGCATGCT 
[G, A] 

TATCTGCTGTGTTTCCCCTCCCCCATGTGTCCCCACTGGCCTTTGCACATGGGAGAAGGG 
CATGTGCTCAGCATATCACTCAACTGTCCACATTGGGTGGGTACCTGTGTGTGGTGTGTG 
TGTGTGGGGGGTGTGTCTTGAAGTGGCAGGTCCCAAATGCTTAGGCAATCTGAACCTTGG 
ACCTTGCAGAGAGGAGAGATGTCCCTGTAGGTGGGAGGGACAGGGAGATGCAGCAGCTGC 
CCGGTGACCTTTTCTGCCCTTGATGGGCAAAGCTGGGGGTAGGGAAAGGAGACAAGTGCT 

20881 TTGAAGTGGCAGGTCCCAAATGCTTAGGCAATCTGAACCTTGGACCTTGCAGAGAGGAGA 
GATGTCCCTGTAGGTGGGAGGGACAGGGAGATGCAGCAGCTGCCCGGTGACCTTTTCTGC 
CCTTGATGGGCAAAGCTGGGGGTAGGGAAAGGAGACAAGTGCTCATACTTACCTCCCTCC 
CTGCCCAGGCTCCTCTGTAAGGGTCTGAGTCTGTCTCTGTGAGCCATTGCATCTGTCTGT 
CTATGCCCTGATGCCTGGATGGACAAGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
[A,T] 

GTGTGAGGCTGCAGGAAGAGGAACAGTGGGGGATGGGCAGGAAAGTGGGCTGTGGGGTCA 
GGGAGGCGAT 



FIGURE 3, page 12 of 12 



